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THE MEAN-VALUES OF ARITHMETICAL 
FUNCTIONS 


By F. V. ATKINSON and LORD CHERWELL (Ozford) 
[Received 12 May 1947; in revised form 25 September 1948] 


1. IntRopvuction. A principle of frequent occurrence in parts of the 
theory of numbers is the following: 


Let {A,}, {a,,} be two sequences connected by the reciprocal relations 
A, — 2, On» ay ~e > p(m)A.,,. 
m\n m|\n 
Then the numbers A,, have a definite average value, provided that 
(i) the sequences of values of n over which the average is taken is 
sufficiently regular, and 
(ii) the numbers a,, are sufficiently small. 


In applications A, is a given arithmetical function of n. A note- 
worthy particular case is given by taking A, to be 1 or 0 according 
as » has or has not some special property; this case is the problem 
of the distribution of integers of a given type in a given sequence. 

In this paper we consider how far this principle can be justified on 
elementary grounds. There are two reasons for such an investigation. 
In the first place results emerge which, though not without interest, 
are nevertheless elementary or else can be proved by elementary 
means with the help of known results such as the prime-number 
theorem. Secondly, the territory which we explore contains non- 
elementary regions, and here problems arise which, as we explain in 
§ 8, have been little investigated; to these problems we may hope to 
direct the attention of other workers. 

As regards the simplest case, that of the result 


N43 A, > Sa,/m as N- oo, (1.1) 
the sufficiency of the following pair of conditions, namely, 

Sa, = o0(N), > |a,| = O(N), (1.2) 
is an immediate consequence of a result of A. Axer.+ We establish, 


+ Axer (1). 
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in Theorem I, a more general criterion, of which another particular 


case is N 
> a, = o(N/log N). (1.3) 
I 


We indicate briefly the extension to the average of A, when m runs 
through an arithmetical progression. The results include many of 
the standard mean-value and distribution formulae of the elementary 
theory of numbers. 

In §§ 4-8 we consider a more recondite problem: the average of A, 
where n runs through a polynomial sequence; the results allow of an 
extension to the average of products of the A,. In § 9 we show by 
considering a simple case how such results can be proved when the 
sequence of values of n depends on the primes. 

Among cognate research we would mention 

(i) the recent work of van der Corput (2), Wintner (3), and 
Erdés (4), on the case when A,, is multiplicative; 
the extension of such results to algebraic number fields, by 
Axer (5), and Schleser (6); 
the analytic approach, which has yielded such results as the 
Tauberian theorem of Ikehara. However, this result, though 
‘deep’, does not include all that is obtainable by elementary 


methods. 


2. We now prove 


THEOREM I. Let A(n) be a positive non-decreasing function of n, 
such that X(n)/logn is non-increasing. Then (1.1) holds if 


N N 
Yan = o{N/MN)}, — log(N>* ¥ |ay|) = OW)}. 
n . n= 


Proof. We have 
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where & is an integer between 2 and VN to be chosen later. Then, 
by hypothesis, 


N N 

a 1) — ({—_ ekMN/R)| — *" pkMN) 

- o{ En! (Re oR } 
R 


where k is some positive constant. We have also, by Abel’s lemma, 


ELD <a aka 


7 <x 


=o) 
r—I 


pee See 
— AY(r—D/P 
since A(n)/logn is a non-increasing function and r = O(WvN). Hence 


_ _(NlogR 
% = hi} 


Now take the case A()/log N > 0. Then for any selected positive h 
we may take R so that 


log R > hA(N), log R = Ofhr(N)}, 
provided that N is sufficiently large. This gives 
Y,=0(Nh), = OfNexpf(k—hAN)}], 
so that N-"(| >, + Ds |) = Ofc} +0 (h). 
Since h can be chosen as large as we please, this implies that 


N-(3 + id >0 as N-o. 


IT i» = 
~<m< 
rr. rl 


It remains to consider the case A(N)) ~ clog N, where c is a positive 
constant. In this case we take R = [VN]. We have then 


N 
> an = o(N/log NY), 


m=1 


so that a, = o(n/logn), 
whence >. = o( p ia.) = o0(N). 
n<V4N+2 
Lastly we have, as before, 
>; = o[fA(N)}-N log RF] = 0(N), 
since log R = O(log NV). 








68 ¥. V. ATKINSON AND LORD CHERWELL 


This completes the proof of Theorem I. The conditions (1.1), (1.2), 
are obtained by taking A(n) = 1, log respectively. 
The extension to arithmetic progressions runs as follows: 


THEOREM II. Let a, b be two integers, and let X(n) satisfy the con- 
ditions of Theorem I. Then 
(m, b)a 


Way 4,302 5 Se, 
nN 34 m 
= mas! 
nma(modb) (m,b)\a 


provided that 
ya, = ofN/MN)}, — log(N+ ¥ |a,|) = ON)} 
n<N \ nN 
for all residue-classes c (mod 6). 


The proof is similar to that of Theorem I and is omitted, since the 
extension is covered in part by Theorem ITI. 


3. For many classical arithmetical functions the existence of the 
mean-value is guaranteed by the absolute convergence of the series 
> a,n-. If, as usually happens, the function a, is multiplicative as 
well, we get 

N ES 
HW 24s > 2 Sn gt = I (1+p—a,+p~ap:+...), 
with a corresponding modification for averages over arithmetical 
progressions. Thus, for instance, for Euler’s function ¢(n) and 
arbitrary s, 


8 N =" 8 
> (Oe) ~ es L Pal) TDC a) 
onan) pio p\(a,b) 
The case s = 1 is classical. The case s = —1 has been considered by 
Titchmarsh (7), Estermann (8). A more delicate example, which is 
still covered by Theorem I, is given by r(n), the number of partitions 
of n into two squares.} 

More generally, instead of stipulating that a, should be multiplica- 
tive, we may merely require that the associated Dirichlet series 
should admit a representation as an Euler product, of such a form 


as, for instance, 


dann = TT (1+44,97"), (3.1) 


dr 


+ Hardy and Wright (9), 268. 
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where q;, 72;--. are any sequence of mutually prime integers. We have 
then the formulae 


N 
An = II (1+-a,), a 2 An es II (1+q~"a,), 
n= q 


q\n 


where qg runs through the sequence q,,q>,...._ Absolute convergence 
of either side of (3.1) is a sufficient condition, as before. A similar 
result can be proved for the function 


A, = [J (1+4,)*, 
arin 


where «, is the highest power of g, which divides n. We get 


N a -1 
N- > A, > }—— 2} , 
»> : I] ( q,-— 5) ; 
provided that the infinite product is absolutely convergent and that 
[1a] <q 
Corresponding to any such sequence q,,q,..., of mutually prime 
integers we may set up analogues of the standard functions of number 
theory. Taking, for instance, a, = 1, we get the functions 
Qex(n) 22m) 


where w(n) denotes the number of integers of the sequence q).... 
which divide n, and Q(n) the corresponding number when multiple 
factors are counted according to their multiplicities. We get 


vay mw TT (142), 
— ar m 


—/) 
N-1 2QAn) _— qr é 
ay I] = ) 


provided in each case that the infinite products converge, and in the 
second case that 2 is not a member of the sequence q,..... 

Again, if we take a, = —1, then A, = 0 or 1 according as n is or 
is not divisible by any members of the sequence q,,..... The result is 


then that l 
6-2) 
qr 


ar 
is the asymptotic density of the numbers which are not divisible by 
any q, provided that the series } q~! converges. The cases of square- 
free,t cube-free,..., numbers are included trivially. A more delicate 


+ Hardy and Wright (9). 
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case is given by taking as the sequence q,,... the set of all pairs of 
primes which differ by 2. Similar results, of course, hold when the 
number runs through a general arithmetic progression. 

It is interesting to compare the scope of Theorem I with that of 
the much ‘deeper’ Tauberian theorem of Ikehara (10). The trivial 
cases in which the justification is by absolute convergence of the 


series a 
~t 
> a,n 
n=1 


fall within the scope of both results. We have, however, made no 
restriction on the sign of the numbers A,. Furthermore we do not 
require the condition that the function 

wo 

> a,0* 

n=1 
should be regular on the line R{s} = 1. 

4. In considering mean-values taken over polynomial sequences 

we need the following 


Lemma 1. Let «(m) denote the number of roots, distinct to modulus m, 


of the congruence f(n) = 0 (modm), 
where f(z) is a polynomial with integral coefficients and no double zeros. 
Then a(m) = O(m*) 


for any positive e. 

It will be sufficient to prove this result in the case when the 
coefficients of the polynomial f(z) have no factor common to all. 
Furthermore, since the function a(m) is then multiplicative, we may 
confine ourselves to showing that the result holds, uniformly, when 
m is a power of a prime p. 

If p is such that the simultaneous congruences 

f(n) = 0 (mod p), f'(n) = 0 (mod p) (4.1) 
have no solution, then it is knowny that a(p”) has the same value for 
all values of r, and is less than or equal to the degree of the polynomial 
f(z). This proves the required result. 

It remains to consider the cases in which p is such that the con- 
gruences (4.1) are simultaneously soluble. Now any common factor 
of f(n) and f’(n) must also be a factor of the algebraic resultant of 


+ Hardy and Wright (9), 96. 
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the polynomials f(z) and f’(z). This is an integer which is not zero, 
since f(z) has no double zeros. Hence there are at most a finite 
number of such primes. Let p be one of them and let p* be the 
highest power of p which divides the resultant. Let pw, v be any 
integers such that 

w>aa, l<svcp. 
Then any solution of the congruence 
f(n) = 0 (mod p’) (4.2) 


with r = »+v is also a solution with r = p». Let x be a solution with 
r =p. Then the solutions with r = »+v which correspond to z will 


be given by f(x+-yp") = 0 (mod p+”), (4.3) 


those solutions in y being reckoned distinct which are distinct (mod p”). 
Now (4.3) is equivalent to 
S(x)+-yp"f' (x) = 0 (mod p+”), 
so that, if y,, y, are any two solutions of (4.3), we must have 
(Y¥1—Ya)f'(x) = 0 (mod p”). 
If y > A, this implies 
Y1 = Y2 (mod p™), 
so that the number of solutions in y of (4.3) which are distinct 
(mod p”) is at most p’. If, on the other hand, v < A, then the same is 
true trivially. 
Summing up we have, for w > A, and uw < p < 2y, 

a(p) < pra(pH). 
and hence, for p < p < 2%, 

a(pP) < pa(pH). 
We now fix p» and let o take indefinitely large values. This completes 
the proof of the lemma. 


5. We can now prove 
THeoreEM III. Let {A}, {a} be two sets of sequences such that 


A® = Fa® (r = 1,2....,2). 


min 
Let fn) (r = 1, 2,...,2) 


be any | polynomials with integral coefficients, of maximum degree k 
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such that no two have a zero in common and none has a double zero. Then 
N 1 
N* > II 4% >, 
n=1r=1 
where C is a constant, provided that 
(i) |a”| is bounded for r = 1, 2,...,l and n = 1, 2...., 
co 
(ii) the series > |a|n-° are convergent for r = 1, 2,...,l, and some 
n=1 
o < I/k. 
Proof. We have 
N 1 
> TT Agi = > > am %m DL I 


n=1 r=] m<fi(N) m<fi(N) n<oN 
f-(n)=0(mod m,) 
(r=1,2,...,1) 


= 2 + 2% =i, + ie = 


Mm, M2...mjisN m,™M2...m>N 


Take first es which is, in fact, the dominant term. Let 
a(M,, Mg,..., M%) 
denote the number of roots, distinct (mod m, mg...m,), of the simul- 
taneous congruences 
f{n) = 0(modm,) (r= 1,2,...,1). 


We have then 
bs 2 = amy, My), ae + O10). 


M, Mzy...M 


nN 
SA(n)=0(mod m,) 
(r=1,2,...sl) 


peyeeey 


We show first that 
x(M,, Mg,...,M,) = Of(m, mg...m)*} 
for any positive « and any admissible set of values of m,, mg,...,m. 
Denote further by «,(m) the number of roots, to modulus m, of the 


. . e 
a f,(n) = 0 (mod m). 
Then we have 
mM, Mz...M 
a(M,, Mg,...,M) < St) ge 
1) Mq,.--, Ms 


where {m, mzg,...,m} denotes the least common multiple of the m’s. 


But the ratio 
My Mzg...1™/{My, Mg,..., My} 


is bounded. For any common factor of m,, m, is also a common factor 
of f,(n), f(r) for some n, and so must be a factor of the algebraic 
resultant of the polynomials f,(z), f,(z). But in view of the hypotheses 
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this resultant does not vanish, so that there are at most a finite 
number of possible factors common to any two m. It follows that 
2x(14, Mz,..., 1) = Ofaxy(my )org(1mq)...04(m)}, 
whence (5.1) follows by Lemma 1. 
We now write 
>... ¥ a a®...a a(m,, me,...,™); 
MyM2,...m=n 
N 
- => of + 000} 
Then the series > b,,/n is absolutely convergent, by comparison with 
the product of the / series 


Ss glt)ene-1 
r c= 
Zz Am, MN, 
M,=1 


each of which is, by condition (ii) of the theorem, absolutely con- 
vergent. Furthermore, since > |b,,|/n is convergent, we have 


> |5,| = o(N). 
nN 
Hence N- >, +> C, 


where C is a constant. 
It remains to consider = If 


M,Myg...m, > N, 
then at least one of the integers m,,mg,...,m, is greater than N™. 


Suppose that it is m,. Then, for fixed m, and n, the number of admis- 
sible sets of values of mg, ms,...,m is at most 


A f,(n)}A{fz(n)}...d{f(n)} = O(n*) 
for any positive e, from a well-known property of the divisor function. 
Also, for given m,, the number of values of n such that 


fi(n) = 9(modm,), n<N 
- { N ae a | 
is Of axl) i) = ofma( +2 l 


Using the fact that the a‘ are bounded we see that the total con- 
tribution to >, of the terms for which m, > Nis 


|a() (a. + 1 Ne ; 


of 
NU<m<fil(N) 
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But, if ¢ is sufficiently small, we have, by condition (ii) of the theorem, 


Ti+e (1) -1 A714 (1) |yy —o—le 
N z | @m, {my < N . > |Qn, |My 


— 
m>N¥ m>NM 


= o(7), 
Ne > lam) = ON {A(N)}7] 
m<fi(N) 
=: O{N¢+**} = o(N). 
Treating similarly the terms for which m, > N™....,.m, > N™, we 
have , 
>. = 0(Y), 


which completes the proof of Theorem III. 


6. A large number of mean-value theorems in the theory of 
numbers may be deduced from Theorem III. Thus, if, as before, 


Sim), fo(m),---s.f(m) 
are integral polynomials of degree not exceeding k with no common 
or double zeros, then ; 
IT Pf (n )f 
r= 


has a mean-value whatever the value of k, while, if o,(n) denotes the 
sum of the ath powers of the divisors of n, then 


II Ont Sl n ) } 


tae | 
has a mean-value, at any rate if —a+1/k > 1. 

However, the main significance of the theorem is that it separates 
out from the class of such problems a substantial proportion of those 
which are elementary. Thus, for example, we can say that the 
problem of the asymptotic density of sets of rth-power-free numbers 


of the form fi(n),.--, film) 


is an elementary one if r is greater than k, the maximum degree of 
the polynomials.+ 


7. The following investigation of a simple case will indicate the 
difficulties involved in extending such results beyond the elementary 
stage. 


+ The case k = 1 has been considered by Pillai (11), Mirsky (12). 
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THEOREM IV. The number of k-th-power-free numbers not exceeding 
N, of the form n'+-h, where h + 0, is asymptotic to 


rj 1 _ XP) 
mie p } 
Pp 
provided that k > 1, k > 2.+ 
Here x(m) denotes the number of solutions of the congruence 
v+h = 0 (mod m*). 

We confine ourselves to proving the case k = /; the case k >- 1 is 
substantially included in Theorem III. Denoting the number to be 
estimated by ¢(N), we have 

¥(N)= > Hs) 


1<n*+h<N stink +h 


= (s) l 
tine” i a st) 
1<n*+h<N 


= . 3 + pa = +e, Say, 


1<scM M<e<NiF 
where p(s) denotes Mébius’s function. 
We have %= > p(s)x(s){NM*s*+O0(1)}. 
1<s<M 
But, by Lemma I, x(s) = O(s*) 
for any positive «. Hence 
= NM DY pwls)x(s)s*+ o( > s*) 
1<s<_M 1<3<M 
= N™{1+0(1)} TT (1—x(p)p*}+ O(M"**). 
7) 
8. Turning to %, we have 
! | = l 
ve ee k sian 
1<nk+hoN 
= . > ” Ie 1. 
ISASN MN NECN 
We need the following 


LemMaA 2. For fixed h and k, with k > 2,h #0, we have 
1 = O(log X). 
n*k+h=As* 
0<As*<X 
Let x, y and 2’, y’ be two integer solutions of the Diophantine 
equation Ack = h+y* 


+ The case k = 1 = 2 has been investigated by Straus (13). 
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such that x’ > 2, y’ > y. We have then 
h(a"*—a*) = (xy’)k—(2’y)* 


and hence, since xy’—a'y > 0, 


2’—2 > bf $ (ay re'y) >} {  at-ra’r] 


> hI miniy’*, y'*-y,...; 9° 7} 
> hy 
= h-1(Axk§—h)** 

> Ax, 

for some positive constant A independent of A, x, y, x’, y’. Hence 
x’ > x(1+ A), 
which proves Lemma 2. 
We can now complete the proof of Theorem IV. We have 


t2 = O( & logN) = O(NM-*logN). 
A<NM-+* 
Hence 


Y(N) = NY TT (1—x(p)p-*) +0 (NY*) + O(M4+*) + O(N M-* log N). 
Pp 
When we choose M = N1*+1, Theorem IV follows. 


9. The extension of Theorem IV to cases for which k < 1 would 
seem to depend on proving results of the type of Lemma 2 for more 
general Diophantine equations. The main existing result in this field, 
the Thue-Siegel theorem,+ holds for any particular Diophantine 
equation of degree not less than 3, whereas we should need one 
holding uniformly in the coefficients. 

We have investigated numerically certain cases for which k < l, 
and it is evident that the results show no sign of breaking down. 
We have taken k = 2, 1 = 3, 4, 5, h = 1, 2, 3, and N = 250,000. 

Form Observed Calculated 
n+] y 27-3 

n§ +-2 j 59°5 

ni +3 E 44-7 
ni+l 2 21-6 
n*+-2 16-8 
nit 3 10-5 
n+] j 7 
n5t2 10 
ne+3 8-5 

+ See, for instance, Landau (14), Vol. ITT. 
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10. The results of Theorems III and IV hold with very slight 
modification when ” runs through the primes instead of the natural 
numbers. The principles involved will be sufficiently clear from an 
investigation of the simplest case. 


THEOREM V. The number of k-th-power-free numbers not exceeding 
N, of the form p'+-h, where k >1, hh £4 0, and p is a prime, is asymptotic 


to 
at x(P) 
wile a aa 
log N¥! I] | p*\(p— Dp 
p 
(h,p)=1 
where x(m) has the same meaning as in Theorem IV. 


Let us denote the number concerned by 6(N). Then 


aN)= > > (5) 
1<pl+h<N s*\p!+h 


= > ws) F&F 1 
—T pi hecotmnod &) 
and in this sum we may assume (h,s) = 1, since there are at most 
a finite number of primes p and values of s for which 
pi+h = 0(mods*) ((h,s) > 1). 
Further, if (h,s) = 1, the congruence 
p+h = 0 (mod s*) 
is equivalent to one or other of the congruences 
p=h,(mods*) (r= 1,2...., x(s)), 
where hy, hg,... are certain integers dependent on s and prime to it. 
We now use the following two known results, following Ester- 
mann (15). 
(i) If (a,b) = 1, then for fixed a, b, 
1 n 
pedizdan #0) To 
as n> odo. This is the prime-number theorem for an arithmetical 
progression. 
(ii) For fixed «, with 0 < a <1, and 0 <b < n%,n D> 2, we have 


n 
2 fs a Fees :) 


pon 
p=a(mod 
uniformly in n, a, b. This has been deduced by Titchmarsh} from a 


result of Viggo Brun. 
+ Titchmarsh (7). 
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We therefore have, for fixed s prime to h, 
(N—h)¥ 


a — a 
p= n1, Pong $(s*)log(N —h)™ 


p 
x(s) Nu 
and hence oie . l1w~ Js! 7 a So Ni" 


pit+ * 0(mod s*) 


Again, taking « = } in (ii), we have for variable s such that s* < N¥%, 


{ x(s) Nil ) 
paren = Veet) log NM 


p+h= FP s*) 
and lastly, for arbitrary s, 
> l< > 1 = Of{x(s)(NMs-*+1)}. 
pt+h<Nn nit+h<oNn 
p'+h=0(mod s*) n'+h= =0(mod s*) 
Collecting these results we have 
rut 
x(s) 


ON) = = M8) ok) log wid) Ts 


s< N1 2k 
(h,s)=1 


+0(Nu (s)s-*)+O( >. ,_X(8)) +O(1), 


Niikle 


where €,, €,,... are numbers which are uniformly bounded and which 
tend to zero as N + oo. Now the series 


i> 9) 
(hy 8) 
is absolutely convergent, as is an from its product representation 
Tl | ee 
u pe\(p—1)J 
(h,p)=1 


since, by Lemma 1, x(p) = O(ps) 


for any positive «. It follows easily that 


x9), oe 
Kian AR) 0 (A 00). 


We have also 


( x(P) = 
py 
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Lastly, we note that 
logN > x(s)s*+0 (N>oo), 
s> Ninn 


since x(s) = O(s*) 
for any positive «, and that 


> x(s) = O(NG+e) 


s< Nut 
= o(N™) 
since k > l. 

This completes the proof of Theorem V. It is fairly clear that 
Theorems III and IV may be extended in the same way. 

Similar results would seem to be possible for conjugate problems, 
such as the representation of a number as the sum of a kth-power-free 
number and the /th power of a prime.7 

This paper has its origin in a monograph by Lord Cherwell extend- 
ing the scope of a method used by him in calculating the distribution 
of the intervals between prime numbers.{ Of the present paper 
Theorems I-II and their mode of application are due mainly to him ; 
the extensions discussed in Theorems ITI—V are due mainly to F. V. 
Atkinson, who also takes responsibility for the correctness of this 
draft. 

+ See Estermann (15). 
t Quart. J. of Math. (Oxford) 17 (1946), 46-62. 
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A REMARK CONCERNING THE CHARACTERISTIC 
ROOTS OF THE FINITE SEGMENTS OF THE 
HILBERT MATRIX? 


By OLGA TAUSSKY (London) 
[Received 10 June 1948] 


1. Tue infinite matrix (1/(i+-j)) and its associated quadratic form 
have been studied for a long time. Hilbert (1) and several others 
proved that it is a bounded matrix, i.e. that 


2rd 2 [dace 


The i lity P< 1 
e inequality >. > f/>4 iit (1) 


i,j=1 
is usually called Hilbert’s inequality. 

It is known that the greatest characteristic roots A, of the finite 
segments (1/(i+-))) (i,j = 1,...,m) tend to 7 when tends to infinity. 
The convergence of the sequence A,, is, however, slow. It can further 
be shown, by rather complicated arguments, that 7 is not a charac- 
teristic root of the infinite matrix. 

It will be shown in § 3 that 


A, = m1 O = 
\ log n 


1 ee 
(=) (6,4 = I,...,@) 


is symmetric, its greatest characteristic root is§ 


\, = max > il >. x3, (2) 


where the vectors {2,;} are arbitrary. 


\ + 
y* 


2. Since the matrix 


+ This work was carried out when the author was employed at the Institute 
for Numerical Analysis of the U.S. National Bureau of Standards. 

{t The problem of studying the greatest characteristic roots of the segments 
of the Hilbert matrix was suggested by W. W. Sawyer and was communicated 
to me by the Oscillation Sub-committee of the Aeronautical Research Council 
in Great Britain. With collaborators Mr. Sawyer has attempted to find an 
empirical formula for (7—A,)~1 (see Nature, March 6, 1948, 361). 

§ When no special reference is made, all sums are carried out over every 
suffix occurring in them from 1 to n. 


[Quart. Journ. of Math. (Oxford), Vol. 20, June 1949, pp. 80-83] 
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It will be shown that 


Df Zt {4 ees] @ 





if {x,} = {1, 1/v2,..., 1/vn}. 
Since > x? = logn+ O(1), 
it will be enough to show that 
XU, %; 
> am = nf{logn+O(1)}. (4) 


From (2) and (3) it follows that a number less than z could not be 
a limit point of the sequence A,,. Hence limA,, = 


3. Lemma. If p, q are integers such that 0 < p < q, then 


~ ee, See dx ~~ 
2 fast J me x(p+q—x)} wa Shs 


Proof. Denote by p the integral part of }(p+-q+-1). Consider the 
difference t+1 


meme NE | cea 
(Xp+a—OF J [e@+a—=)) 
Since the integrand is decreasing in the range p <t < p, the 
difference e(t) is always positive in this range. 
By the first mean-value theorem we have 
e(t) = [t(p+q—b]4*—[2'(p+9q—2')[4, 
where t << 2’ = t+6’ <t+1. Applying this theorem again we find 


that 
e(t) = —36'[t'(pt+q—t’)]"[p+9q—2¢), 

where t < t’ < 2’. We have 
pes De A 
2t'(p+q—t’)t 2t'(p+q—t') ~ 2t'(p+q—t')E 
Now, in virtue of the relations 

pst<t<tHl<p, 0<@ <1, 

1 

t(p+q—p)"” 


3695.20 G 





ee) = 








we have e(t) < 


bo 
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and so, summing, 


p—t , 
= 8) < ppt + O(1)] 
= Ole) = O14") 
since 4q<p<q forall p. 
We now observe that 
q 
i l dx et 
— — = 2 e(t) +-O(q-) 
i [s(p+-q—s)}! | [x(p+q—z2)}* P2 ; 
= Of¢-"), 
If in this lemma we put p = 1, g = 7, we get 
~~ 1 , dx 
5 < = —- == O(r-*). 
p [s(r+1—s)]# | [x(r+-1—z) |} 
4. Proof of (4) 
We write 
v2 has ue Ley , S Uj; 
, ae 2 iit at 2 S i+) 
=1i+j=r-+ r=2 i1+jJ=rtn 


Denote the first sum for {x,} = {1, 1/v2,..., 1/Wn} by >, the second 
by >.. It will be shown that 


>, = z{logn+O0(1)}, >. = (1). 


a | 
In virtue of the special case of the lemma we have 


r 


- = = “. ¥ dx 1\ 
N me . . <a =e o_ 
a oP rs 1] Bo [ s( =e “a [a(r+-1—a)]* ol) 
i 
= 


= 2faresin,/(_2 (2) fs) 
5 2 4—2 aresin /(—}) , 


Since 





we have 
n 





- “ { —I%aresi oe a 
:# - > aa 2aresin,/(—)} + O(1). 


r=1 
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Since = 


pa 79 = mlogn+O(1), 


n 
and 


4 r 
——aresin 2m == ((1), 
r+1 r+1 
r=1 
we thus obtain 


, = a{logn+O(1)}. 
By the lemma we have 


1 
— [s(r+-n—s)]}* 


= 1 h dx l 
af? aA iaenept o(%) 


— 1 


n d 
rtn | pepae ep Om) 
1 


. r 
——}4n—2arcsin | 
~ rn n+r}} 


= O(1), 


, 1 
since b = 
r+n 


—— = log 2n—log(n+2)+O(1) = O(1) 


; : r 
and since aresin —— 
n+r 


is bounded. This completes the proof. 
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See, for instance, Hardy, Littlewood, Pélya, Inequalities (Cambridge, 
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A VARIANT OF A CLASSICAL ISOPERIMETRIC 
PROBLEM 


By A. 8. BESICOVITCH (Cambridge) 
[Received 12 May 1948] 


Tus article is devoted to a solution of the following 
Prosiem. Find the maximum value of the area bounded by a simple 
curve of length X > 2a if the radius of the largest inscribed circle is 1. 
This is a variant of the classical isoperimetric problem. Apart 
from its attractive simplicity, the interest of the problem lies also 
in the fact that the usual methods of variations of the boundary do 
not seem to lead easily to a solution. 


1. Definitions and notation 
For simple curves and for domains bounded by them the same 
notation will be used. The length of a curve I will be denoted by AT’. 
If T is simple and closed, then the area of I will be denoted by |T’|. 
The above problem can easil, be shown to be equivalent to the 
problem: 


If p is the radius of the largest circle inscribed into a simple closed 
curve I’, then, for given values of || and AT, find the lower bound of p. 


Griinwald and Turan} have proved that 
1 |r 
i seas seal te 
tad 4nr+2 AT’ 
and have announced that Griinwald and Varzsonyi had proved that 
1 iF 
ere’ 
Their proof has not been published and the first proof of this 
inequality was given by Santalo.t In this paper, by an entirely 
different method, I find the lower bound of p (which is different 
from |I'|/AT) and also I find the class of all the curves [' for which 
the lower bound is attained. 
Let [I be a simple closed curve of length A. Choose an origin O on 
r and a positive direction of arc. Take an arbitrary point Me¢«T 
+ Acta Litt. Sc. Reg. Univ. Hungaricae, 8 (1936-7), 238. 
t Revista Un. Mat. Argentina, 10 (1944, 5), 155-62. 
[Quart. Journ. of Math. (Oxford), Vol. 20, June 1949, pp. 84-94] 
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and let s be the length of the are described in the positive direction 
from O to M. I shall denote M simply by s. By 3,8, or by (8,,8,) 
I shall denote the are of I described in the positive direction from 
8, to 8,, so that, if O does not belong to s,8,, then 8s, < 8,. In this 
notation 8,8, is the complementary arc to 8, 8, so that 8,8,+8,8,=T. 

A circle c is said to be inscribed into T if the interior of c is included 
in the interior of [ and if the boundary of c has at least two points 
in common with [’. All the points in common will be called points 
of contact whether or not at such points a common tangent exists. 
An are s,8, of T such that the points s,, s, belong to c, but no other 
point of s,8, does, is called a pocket of T onc. The domain partly 
bounded by s,s, and cut off from T by c is also called a pocket and 
is denoted [s,8,c] or [(s,,8),c]. There may be many circles sub- 
tending s,8,. If no ambiguity can arise about the circle subtending 
818, then the pocket may be denoted by [s, 8,] or [(8,, 82)]. 


2. Lemma 1. If 8,8, is a pocket of T on c and if an interior point s’ 
of 8,8 is a point of contact with another inscribed circle c’, then all the 
other points of contact of c’ with T belong to 8, 85. 


A proof is left to the reader. 

A circle inscribed into [, all of whose points of contact belong to 
the are 8,8, at least one of them being an interior point of 8, 85, is 
said to be inscribed into s,8,. Take a positive number a <7. A 
pocket s,s, of T is said to be a small or a large pocket according as 
As, 8 is <a or >a, so that, if As, 8, = a, 8,8, is both a small and 
a large pocket. 


3. Lemma 2. Given a pocket s’s” (s’ < 8") of T on cy and a small 
positive number 8 less than half the length of the chord 8's”, there exists 
a circle c, inscribed into s's" and having points of contact within 8 of 
each of the points s’, s” (which includes also the case of one or both of the 
points s', 8”, being points of contact). 


If the circle c’ = c(o’,r’) passing through the points s’, s” and 
touching the are (s’+-5,s”—8) does not include in its interior any 
points of the ares (s’,s’-+8) and (s”—8, 8”), then c’ can be taken for ¢,. 
Otherwise let r” be the distance from o’ to the more remote one of 
the ares (s’,s’+8), (s”—8,s") if they are not equidistant from o’, 
and to each of them if they are. In the second case the circle c(o’,r”) 
may be taken for c,. In the first case c(o’,r”) touches one of the arcs 
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and crosses the other one. Obviously there is a circle homothetic to 
c(o’,r”) with respect to the (or a) point of contact with the first arc 
and touching the second one. This circle can obviously be taken for c,. 


DEFINITIONS. An inscribed circle having a pair of points of contact 
81,8 such that As, 8, > «a and As,s, > « is called an ‘a-circle’ (it may 
have other points of contact). 

The aggregate C(x) of all «-circles will be called the ‘x-component’ 
of 1. C(a) is obviously a closed set of circles. 

An inscribed circle on which T forms more than two large pockets 
each containing points of C(x) is called a ‘ramification’ circle. 


By Lemma 2, if ¢ is an a-circle and [8, 8.,¢] is a large pocket, then, 
if As,s, >a, there is an a-circle inscribed into the pocket. If 
As, 8 = a, there is at most one a-circle inscribed into the pocket: it 
is the circle through s,,8, and touching 8,8, at an interior point. 
Even if there is no such circle, [s,s,,c] need not be free from points 
of C(x), for there may be circles through s,s, other than c that 
penetrate into the pocket deeper than the circle c. 


Lemna 3. If T has k ramification-circles, then (k+-2)a < A. 
The proof is left to the reader. 


4. Lemma 4. Given two a-circles c, cy (Fig. 1) joined by ares 8, 8, 8384 
(8; < 8 < & < 84) (expressing it more clearly, 8, 8, is a pocket on Cy 
and 838, on c) and an a-circle c’ having contact with 8,8, at 8}, 85 
(8; < 8, < 83 < 83 83—8, > a) and no contact with 8,84, then there 
exists a ramification-circle c, on which 8,8, has a large pocket and 
which has contact with 8584. 


For let c” be an a-circle subtending the largest are s{ sj on 8,8, 


- ” 


(8; < 8s] < 8, < 8) < 83 <8,). The existence of such a circle follows 
since C(x) is closed. If it is inscribed into s$s{, then by Lemma 1 it 
touches 83 8; between s, and s, and thus it is a required ramification- 
circle. If, on the other hand, c” is not inscribed into 8} s{, i.e. has no 
contact with the interior of sjsj{, then we distinguish several cases: 
(i) 8; < 8},83 < 8,. By Lemma 2 and by the definition of the circle 
c” there exists a circle through sj, 83 inscribed into s}s{ which is a 
required ramification-cirele. (ii) s, = 8{,83 = 8. By Lemma | there 
does not exist an inscribed circle through s/’, sy’ near s{, 83 and such 


that at least one of the inequalities si’ < sj, 8; < sj is satisfied. Then 
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again by Lemma 2 there exists a circle through s{, 83 inscribed into 
838; that again gives a required ramification-circle. (iii) the cases 
8, < 8], 83 = 8 and s, = 8{, 83 < 8, lead to the same conclusion. 

We say that an inscribed circle c’ is between c and Cp if all the 
points of contact of c’ with T belong to the arcs s,3, and 838,. If 
two points of contact of c’ belong one to each arc s,s, and 8384, we 
call the chord of contact a main chord. 


CoroLLary. If there is no ramification-circle between c and Cy, then 
the main chords of contact of «-circles between c and ¢, form a monotone 


set. 


Fie. 1. 


For by Lemma 4 any a-circle between c and c, touches in this case 
each are 8, 8, and 8,8,. By the ‘monotony’ we mean that, if s.s4, 8; 84 
are main chords of contact of two circles and s, < 8, < 8] < 8», then 


8, > 8, > 8} > 85. The result follows immediately from Lemma 1. 


5. Denote by C the boundary of C(«). If T is not contained in C, 
then [—C is a set of open ares. 

Lemma 5. Any arc of [!—C is a pocket on an a-circle. 

Suppose that the lemma is not true, and that an are s,s, of [—C 
is not a pocket on any a-circle and thus has its end-points on two 
different «-circles cy, and c (Fig. 1). If there are more than one 
x-cireles through s,, we take for cy one for which the end s, of the 
pocket starting at s, is the nearest to s,. As neither cy nor any other 
x-circle through s,,8,, if such exist, is inscribed into 8,84, then by 
Lemma 2 there is a circle inscribed into s,s, and touching 8,8, at 
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points arbitrarily near s,,8, and not at s,. Such a circle is obviously 
an a-circle, which is impossible since no a-circle has contact with the 
interior of 8, 85. 

COROLLARY. C(a) is a continuum. 


6. Given an a-circle c and a large pocket s, 8, (8, < 8.) on it that 
is free from ramification-circles, the «-circle c’ inscribed into 8,8, 
that does not separate C(a«) is called the last a-circle of the pocket. 
Let s}, 8; be the points of contact of c’ with s,s, nearest to 8,, 85 
respectively. By Lemma 4 every a-circle inscribed into s,s, has 
contact with each of the ares s,s; and s,8,. Any arc 8's” of s,8,—C is, 
by Lemma 5, a pocket on an a-circle. As there is only one circle c, 
through s’, s” touching s,s, it is the only a-circle subtending s’s” 
Replace s‘s” by the circular arc of the pocket [s’s”,c,], and do the 
same with every arc of s,8;—C. Denote by 8,8; the are s,s, thus 
modified. Similarly the are s,s, is defined ; s,s; and s,s, are simple 
ares without points in common, and every point of each of the arcs 
is a point of an «-circle. When a point moves along s, 8, from s, to 8}, 
the a-circles to which the point belongs occur in the same order as 
when a point moves along 8,8, from s, to 8}. 

The figure A consisting of the circles c, c’ and of the part of the 
plane between these circles and the ares 8,8}, 838, is bounded by a 
simple curve and is equal to the sum of c and of the part of C(x) 
included in the pocket [8, 85, c}. 

If the pocket s,s, is not free from ramification-circles, then we 
take for c’ the ramification-circle inscribed into s,s, and nearest to c. 
We define the points s{, 8;, the arcs s, 8, 8,8, and the figure A as 
before. The figure A consists of the circles c and c’ and of the part of 
C(a) between c and c’. 

We now conclude without difficulty that C(a) is a simply-connec- 
ted closed domain and that its boundary C is obtained from [ by 
replacing arcs of [—C by the arcs of a-circles subtending them. 
Hence 1=AC <AT =A. 











7. A simple figure A on C(x) is defined as a closed set of «-circles 
satisfying the following conditions: 

(i) A is a closed simply-connected domain; 

(ii) the boundary of A (denote it also by A) is a dni closed 
curve of finite length; 
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(iii) there is a pair of points M, M’ dividing the curve into two 
arcs MM’ and M’'M so that every circle of A has contact with each 
of the two ares. A circle passing through M or M’ is considered as 
satisfying this condition. 

Lemma 6. (i) Jf AA < 27, then |A| < (AA)?/47; 
and (ii) of AA > 2n, then |A| < AA—z. 

(i) This is a classical result. 

(ii) Let PQ (Fig. 2) be the arc of A of length 7 that contains the 


point M and whose end-points belong to the same a-circle. The arc 
P’Q’ containing M’ is defined similarly. Such ares can always be 
defined and may be assumed to be non-overlapping. It is easy to see 
that, given « > 0, we can define two monotone sequences 
P=P,,F....P,=P and Q@=Q,.9,....9,=¢ 

of points of ares PP’ and QQ’ such that, for any 0 <i < n, P; and 
Q; belong to the same a-circle and 

A~ P,P; <<, A ~ Q;-19; < «€. 
A number of consecutive points of the same sequence may be 
coincident. Any segment P,Q; is less than, or equal to, the diameter 
of the a-circle through P,Q; and thus < 2. We have 


A = PMQP+Q'M'P'G + ¥ P1Q-10:P,P-. 
1 


Hence 


Al < }n+4n+(1+e) > (A ~ P..P+A~ Q;-19%); 


A! < w+(1+e)(A ~ PP’+A ~ QQ’) 


and finally 
|A| < wt+A ~ PQ’+A ~ QQ’ = AA—z, 


which proves the lemma. 
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By examining the proof of Lemma 6 we arrive at 


Lemma 7. If a simple figure A satisfies the condition |A| = AA—z, 

then 
(i) the arcs PMQ and Q’M'P’ are circular of radius 1; 

(ii) all the circles of A are of radius 1; 

(iii) each of the arcs PP’, QQ’ has one and only one point in common 
with every circle of A. 

This includes also the case when one of the ares PP’ and QQ’ 
reduces to a point. 

It will be shown that these conditions are also sufficient. 


8. Our problem now is an estimation of |C(«)|. We shall arrive at 
it by representing C(a) as the sum of a finite number of simple figures 
with a certain degree of overlapping. 

If C(«) has no ramification-circles, then C(«) itself is a simple figure. 
For, if C(«) consists of a single circle, this is obvious. If C(«) has 
more than one circle, let c be one of them, and c’ be the last circle 
of a (or of the) large pocket on c. The whole of C(«) apart from c’ is 
then inscribed in a single large pocket on c’ and thus is a simple 
figure. Hence, if C(«) has no ramification-circles, then 

|C(a)| <I—n. 


Lemma 8. If the conditions (i), (ii), (iii) of Lemma 7 are satisfied, 
then |A| = AA—z. 

For let a circle c of A touch the ares PP’, QQ’ (Fig. 2) at the 
points II, Q respectively, and a neighbouring circle c’ at II’, 0’. 
The common chord of c and c’ lies between the chords IQ and IT’Q’, 
and, as c’ tends to c, the common chord tends to the common 
diameter, from which we conclude that TQ is a diameter of c. Thus 
each circle of A touches the arcs PP’, QQ’ at the ends of a diameter. 
The line of centres 0, o’ of c, c’ is perpendicular to the common chord, 
and thus nearly perpendicular to each of the diameters IQ, IT’Q’. 
Hence the area of the quadrilateral IIII’Q’Q is asymptotically equal 
to the sum of chords IIIT’, QQ’, from which the lemma follows. 

Observe that under the above conditions the line of centres of the 
circles of A is a simple arc y of length }AA—7a whose tangent at every 
point o is perpendicular to the diameter of contact of the circle of 
centre o of A, and the boundary of A is the locus of points distant 
1 from y. 
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On any normal to y the points distant 1 from the foot of the normal 
belong to the boundary of A. 





Let o be an interior point of y and o’, o” two other points of y 
near o and on different sides of 0. Join o’o” by the two smaller arcs 
of radius 1. The point o is on the boundary or inside the figure 
formed by these two arcs; for otherwise at least one point on the 
normal at o, distant 1 from o, would be within less than 1 from at 
least one of the points o’, o” and thus would be an interior point of A. 
Hence at every point of y the upper curvature does not exceed 1. 
An immediate conclusion from this is that the chord of any arc of y 
of length z is not less than 2. 

I shall now prove that the chord of any arc of y of length greater 
than 7 exceeds 2. Suppose that 0, 0’, o” are three points of y in order 
of increasing ares and c, c’, c” the circles of A with centres at these 
points, and that c and c’ touch each other from outside. Then c’” 
neither meets nor touches c: that is, if the chord oo’ = 2, then chord 
00” > 2. Let now oo” be any are of y of length > z. The part of the 
are oo” of length z starting from o subtends a chord of length '> 2. 
Hence there exists a point 0’ between o and o” such that the chord 
oo’ = 2, whence the chord oo” exceeds 2. Thus we have arrived at 


Lemma 9. If a simple figure A is such that |A| = AA—z, then all 
the circles of A are of radius 1, and the line y of centres of the circles of 
A is a simple arc of length }AA—z having a continuous tangent. The 
upper curvature of y does not exceed 1 at all points, and the chord of 
any arc of y of length greater than 7 exceeds 2. 


Suppose now that C(a) has ramification-circles. In § 6 I defined 
two kinds of simple figures corresponding to any a-circle c. Define 
now all such figures corresponding to every ramification-circle, and 
let A,, Ag,..., A, be all the different ones. We know that the largest 
circle inscribed into [ is a unit-circle. The points of contact of a 
unit-circle with T cannot all lie on an are of the circle of length less 
than 7, for then it would be possible to inscribe into [ a circle of 
radius greater than 1. Thus any inscribed unit-circle is an a-circle. 
Assume that the simple figures are enumerated so that A, contains a 
unit-cirele, and Ag, Ag,..., A,, start from ramification-circles C9, ¢3,..., Cx, 
of A, respectively. Not all of these circles are distinct. The figures 
Aj, +19+++) My, Start from the ramification-circles ¢,, .,,..., Cy, Of Ag,..., Ay, 
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respectively, and so on. We obviously have 


Cla) = A+ ¥ (ey) 


and [C(a)| = |A,| + ¥ |A,—e;|, 
and it is easy to see that 7 


AC = AA, + 3 (AA,—Ac)). (2.1) 


Writing AA; = 2zp; for those i for which AA; < 27, and 1; for the 
radius of c; we have 
|C(x)| < AA\—a+ >’ (AA;—2—a1}) +7 >” (pi?—17), (2.2) 
l= AC = AA,+ >’ (AA;—2ar;)+22 >” (p;—1;) (2.3) 
where the summation &’ is extended over those values of ¢ for which 
AA; > 2x, and =” over the rest of i > 1. Hence 
O(a)| <l—a—a S’ (1—r,)*@—a Y” (2—p,—r(p;—1;) < L—m. (3) 
Now ['—C(a) is the sum of non-overlapping small pockets that 
are included between [ and C. For any small pocket we have 
[8 8,€]| < (As, 85)? < aAs, 8. 
Hence |T—C(a)| < aA 
and, by (3), IP'] < l—a+oad < A—m+ aA. 
This being true for any 0 < a < z, we have 
IT'| <A—z. (4) 
Thus we have arrived at 

THEOREM 1. Jf the radius of the largest circle inscribed into a simple 
closed curve 1 of length X > 2m is 1, then the area bounded by T is less 
than or equal to A—7. 

9. Our problem now is to find all the figures [' for which the area 
attains the maximum value. 

LEMMA 10. The area between two circular arcs of length 1,, 1, and of 
radii 11, T, respectively, both from the same side of the common chord, 
is less than \l,—l,|max(r,, 79). 

A proof is left to the reader. 

Lemma 11. Jf |8,8,,c] is a small pocket, then 

|[$,¢]| < As,s,—t 


where t is the length of the arc of c belonging to the boundary of the pocket. 











ON AN ISOPERIMETRIC PROBLEM 93 


For a given As, 8, and for a given are ¢ of c the area of the pocket 
[s, 8),¢] attains its maximum value when s, s, is a circular are. It is 
easy to see that in this case the radius of s,s, does not exceed 1, and 
the lemma follows from Lemma 10. 


Lemma 12. If |[—C(«)| > 0, then |T'| < A—z. 
For, by Lemma 11, |T—C(a«)| < A—I, 
and, by (3), IT’ | < A—z. 
Thus, if |['| is to have the maximum value, [ must coincide with 
O(a) for any « < zw, and consequently A with 1. 


Lemma 13. If C(«) is not a simple figure, then 
|C(a)| << l—n. 
For suppose, on the contrary, that C(«) has ramification-circles, and 
that |C(«)| = l—7a. Then, by (3), 
>’ (l—1,)? + 2” (2—pi—r7-eg—rs) = 9, 

which can happen only if there is no A; for which AA; < 27 and if 
r; = 1 for i = 2,3,...,8. From (2), (2.1), (2.2), (2.3) we see that, if 
C(«)| attains the value 1—7z, then |A;| = AA;—z for every ¢ = 1...., 8. 
This requires that every A; should satisfy the conditions of Lemma 7. 
Let ¢ be a ramification-circle of C(a) and A; ,A;,,A;, be three of the 
figures expanding from c into the pockets [s}s{,c], [8383,¢], [83 83, ¢]. 
At least two of these pockets are subtended by ares of c of length < 7, 
say the first two. Then the complementary ares of c, each of length 

- a, belong to the boundaries of A;, and A;,, and thus A;,, and A,, 
do not satisfy the conditions of Lemma 7. This contradiction proves 
the lemma. 

Lemmas 12, 13, and 7 show that the equation || = A—7 requires 
that [T° should coincide with C(«), that O(«) should be a simple 
figure, and that all the circles inscribed into [ should be of radius 1. 
Lemma 9 gives further conditions that should be satisfied by the line 
y of centres. 

THEOREM 2. Given any rectifiable simple arc y satisfying the con- 
ditions: 

(i) y has a tangent at all points, which varies continuously as the 
point varies; 

(ii) the wpper curvature of y < 1 at all points; 

(iii) the chord of any arc of y of length > m is greater than 2; 
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then the locus T of points distant 1 from y is a simple curve such that 
the largest circle inscribed into it is of radius 1 and that the area of |T| 
attains the limit given in Theorem 1. 


The remark above shows that all the curves [ for which the upper 
bound of area is attained belong to the class of curves defined in 
Theorem 2, and Theorem 2 states that all members of the class satisfy 
this condition. 

Let A, B be the end-points of y and IT, Q the points on the normal 
at o distant 1 from o (Fig. 2). I leave it to the reader to prove 
that, as the point o moves on y from A to B, the points II and 
Q describe simple arcs PP’ and QQ’, one of which may reduce 
to a point. The arcs PP’, QQ’ and the semicircles on the chords 
PQ, P’Q’ form the locus [. It is easy to see that the only circles 
inscribed into I’ are the unit-circles with centres at points of I. 
Obviously all conditions of Lemma 7 are satisfied, which proves the 
theorem. 














FACTORIZATION OF EVEN GRAPHS 
By R. RADO (London) 
[Received 19 June 1948] 


1. Let T be a finite graph, i.e. a finite set of objects, called nodes, 
in which a symmetric, non-reflexive, binary relation a = 6 holds 
between certain pairs of nodes a, b. If a = b, then (a, 6) is an edge 
of , and the nodes a and b are said to be joined by an edge. Either 
of the nodes a, b is incident with this edge. 

A factor of T, more fully described as ‘factor of degree one’, is a 
set F of edges which has the property that every node of I is incident 
with exactly one edge of F. All factors considered in this note are 
of degree one. 

Tutte (1) proved a very general theorem stating necessary and 
sufficient conditions for the existence of factors of any given finite 
graph. His conditions involve the connected components of certain 
subgraphs of [. In the case of even graphs, i.e. of graphs in which 
every closed cycle of edges consists of an even number of edges, a 
completely different criterion can be deduced from a theorem of 
P. Hall (2). A set S of nodes is called independent if no two nodes 
of S are joined by an edge. Hall’s criterion is as follows: A finite 
even graph T possesses a factor if and only if, given any independent 
set of any number k of nodes a,,, there are k distinct nodes each of which 
is joined by an edge with at least one of the nodes a,, (condition H). 
The object of this note is to prove (Theorem I) that condition H is 
necessary and sufficient for the existence of a factor in the case of 
even graphs I in which the set of nodes has any arbitrary cardinal, 
provided only that [' is locally finite, i.e. that every node is incident 
with only a finite number of edges. The necessity proof is almost 
trivial. The sufficiency proof will proceed along the following lines. 
The proposition to be proved will be replaced by an apparently 
stronger proposition (Theorem II). Then the case of a general T° 
will be reduced, in several stages, to the case of a finite [ and then 
proved by induction. 

Theorem II will be re-stated (Theorem IIT) in terms of systems 
of abstract aggregates and is then seen to generalize a result of an 
earlier note (4). 


[Quart. Journ. of Math. (Oxford), Vol. 20, June 1949, pp. 95-104] 
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2. We use the notation and definitions of the preceding section. 


THEOREM I. A locally finite even graph T possesses a factor of degree 
one if and only if the following condition holds: given any independent 
set S of any finite number k of nodes, there exist k distinct nodes each of 
which is joined by an edge to at least one node of S. 

I call condition H the condition mentioned in this theorem. The 
necessity of condition H follows at once from the fact that every 
node of S is incident with an edge of a factor F, and that the second 
‘end-points’ of these edges are mutually different from each other. 
Here we do not require the hypotheses that T' is even or locally finite, 
nor that S is independent. 

Before turning to the sufficiency proof, I want to show, by means 
of suitable examples, that condition H is not sufficient for the 
existence of a factor if T is locally finite but not even, nor if T is even 
but not locally finite. Let T, be the graph whose nodes are a, b, c 
and whose edges are a = b=c=a. Then YT, is locally finite and 
satisfies condition H but does not possess a factor. Let T, be the 
graph whose nodes are a, b,, €,, by, Co,... and whose edges are 

a=b=c, (v= 1,2....). 
Then [, is even and satisfies condition H but does not possess a 
factor. 

3. We define a generalized graph as a set of objects, called nodes, 
in which a symmetrical binary relation a = b holds between certain 
pairs of nodes, where either a = b or a ~ b. Letters a, b, c, x, y 
denote typical nodes. Generalized graphs are obtained by adding to 
the existing edges of a graph I’ any system of loops each of which 
is incident with exactly one of the existing nodes of . From now on 
the letter [ denotes generalized graphs. The term edge comprises all 
pairs (a, b) satisfying a = b, those with a + b as well as those with 
a=b. A set S of nodes of [ is called independent if a = b (ae S; 
b € 8), i.e. if no node of S is incident with a loop and no two nodes 
of S are joined by an edge. A factor of T is a set F of edges which 
has the property that every node is incident with exactly one edge 
of F. The generalized graph I is called event if the relations 


a =a, =4,=... = 4,_, = a, = A, 





wu 
e aa 
t.,-~6@) 0 <v=H) 
+ This terminology is contrary to (3), where even graphs, by definition, 


contain no loops. 
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imply that n is even. It is well known} that I is even if and only if 
all nodes can be divided into two classes A, B in such a way that no 
two distinct nodes of A and no two distinct nodes of B are joined by an 
edge. We call A, B a parity classification of T. 

Let A be a set of nodes of [. We say that T satisfies the condition 
H(A) if, given any independent finite sub-set S of A, consisting, say, 
of k nodes, there exist / distinct nodes each of which is joined by an 
edge with at least one node of S. 

The sufficiency of condition H in Theorem I is a consequence of 
the following theoreni. 

THEOREM II. Suppose that A, B is a parity classification of the locally 
finite generalized graph 1, and that T satisfies the conditions H(A) and 
H(B). Then T possesses a factor. 

4. Suppose that Theorem IT has already been proved for graphs 
with at most denumerably many nodes. Let T° be any generalized 
graph satisfying H(A) and H(B), where A, B is a parity classification 
of T. Let the graphs [™ (A € L) be the connected components of I, 
and A®), B® be the intersections of A and B respectively with the 
set of nodes of M. Thent [™ has at most denumerably many nodes. 
Furthermore, A®, B® is a parity classification of T™, and I is 
locally finite and satisfies H(A) and H(B”). Hence, by assumption, 
I possesses a factor F”. Then the union of all F is a factor of I. 
Hence the general case of Theorem IT has been reduced to the case 
of at most denumerably many nodes. 

5. Suppose that Theorem IT has already been proved for graphs 
with a finite number of nodes. Let the nodes of T° be ¢,, Cg, ¢3,.... Let 
A, B be a parity classification of [. Define, for any positive integer 
m, the generalized graph I, as follows. The nodes of [,, are ¢,, €9,..., Cp: 
The edges of [},, are all pairs (c,,c,) satisfying either (i) 1 < r < m; 
l<s<m;c,=c,inT or (ii) l<r=s<m;c,=c,inT for at 
least one t > m. Put§ 

Ag, = MlO,, C5000; Cut; Bo = Bhe;, Cg,.005 Cg} 

+ (3), 151, Satz 12. The fact that T may contain loops does not affect the 
validity of the argument. ‘ 

t (3), 79, Satz 2. The fact that T may contain loops does not affect the 
validity of the argument. 

§ Brackets { } are used exclusively for the purpose of explicit definition of 
sets by means of a list of their elements. The meet (intersection) and union 
‘(sum-set) of sets U and V is denoted by UV and U-+-V respectively. If U is 
finite, then |U| denotes the number of elements of U. 

3695.20 H 


m 
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Then A,,, B,, is a parity classification of [,. Also, [, satisfies 
H(A,,) and H(B,,). For let S be a finite sub-set of A,, independent 
in [},. Then S is independent in [, and Sc A. Let S consist of the 
k distinct nodes 2,, 2 ,..., #,. Then, by H(A), there are k distinct 
nodes ¥;,..., Y, satisfying 

x, =y,mYr (l<«<¢h), 


where the p, are suitable indices in the range 1 < p, << k. Then 





x, = y, in T,, since otherwise, by definition of edges in [,,, x, = Xp, 
in [,, which contradicts the fact that S is independent in [,. This 


proves that [,, satisfies H(A,,) and, by symmetry, H(B,,). By our 
assumption Theorem II is already established for [,. Hence [, 
possesses a factor F,,. This means that, given m and any integer A 
in the range 1 < A < m, there is a unique edge y,,, of F,, which has 


the property that, in [,,, c, is incident with y,,,. Since [ is locally 


m? 
finite, there are only a finite number of distinct edges among 
Yu VAstA> VYAtzd+- Hence, by Cantor’s diagonal process, there 
is a sequence of edges yf, y3,... which has the property: 


given any positive integer /, there are infinitely many numbers m 


satisfying 3 
: : Ymar = VY, (1 — A >= l). 


IT shall show that F* = {y7, y¥,...} is a factor of [. For every fixed A, 
y; is an edge of infinitely many graphs [[,,, and therefore an edge of I’. 
The node c, is incident with every edge y,,, (m > A) and therefore 
incident with yf. Finally, if c, is incident with y/, then one can find 
an index m > max(A,u) which has the property that y,,, = yy; 
Ymu = Yp- Then c is incident with both y,,, and y,,,,, and therefore, 
in view of the definition of F,,, A = ». Thus F* is a factor of I. 

6. It has been shown that, in proving Theorem II, we may assume 
that T is finite, |A| = m; |B| = n, 

A = {@,,Gq,...,4,}; 
In the following two lemmas it is supposed that A, B is a parity 
classification of [. The letter » always denotes integers in the range 
1 <p <™, and » integers in the range 1 <vy <n. A node « of [ 
is called singular if « = x, regular if « Aw. If U is a set of regular 
nodes, then J(U) denotes the set of all nodes joined by an edge with 
at least one node of U. We write 
J (25, Xq,..-) Zz) 

instead of J ({x,, %g,..., %,}). 
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7. Lemma 1. m <n, and suppose that all b, are regular and 
that 1 satisfies H(B). Then 1 has a factor. 
Proof. By H(B) the definition of A and B, 
< |J(b,,...,5,)| < m. 
Hence m = n. Again, by H(B), 
J (b,,)+---+I(b,,)| > k 
whenever 1 <k <n; v,< 19 <... < vy. Hence, by Hall’s theorem 


(2), there are n distinct nodes a,,€ J(b,). Then 


{(a uy? 9); (a us Oa) +++3 (Buy On)} 
is a factor of T. 


8. A set S of nodes is called critical if S consists of regular nodes 
of A and satisfies |J(S)| = |S}. 


Lemma 2. If H(A) holds, and S and T are critical sets, then S+-T 
is a critical set. 
Proof. We may assume that 
S = {a,,dg,...,A5}; T= {Oy y0005 Ups Uy py 5005 Ups 
where 0 <r<s<t<~m. Then, using the definition of critical sets 
and the conten H(A), we find that 


S|+|2| = |J(S)|+|1(L)| = |J(8)+-J(7)|+|J(S)I(T)| 
> |I(ay,.-4)|+|I(@,,---54,)| > tr = |S|+|7. 


Hence, in particular, |./(q,,...,@,)| = t, which is the desired result. 


9. We can now complete the proof of Theorem II. We suppose 
that both H(A) and H(B) hold, and furthermore, without loss of 
generality, that 0 << m <n. Let o be the number of singular nodes 
of fT. Ifo = 0, then, by Lemma 1, [ possesses a factor. We assume 
that o > 0, and we use induction with respect to co. 

For any node x, denote by I'(a) the graph obtained from IT’ by 
omitting the node x and all edges incident with «. The graph I'(6,) 
is even, a parity classification being A, B,, where B, = B—{b,}. 

Case 1. Suppose that there exists a singular node b, which has the 
property that the graph I’ = ['(b,) satisfies the condition H(A). 
Clearly, I’ satisfies H(B,). The number of singular nodes of I” is 
o—l. Hence, 7 the induction hypothesis, I’ possesses a factor F’. 
Then F’+-{(b,, b,)} is a factor of I. 
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Case 2. Suppose that for no singular node 6b, the graph I(b,) 
satisfies the condition H(A). We may assume the nodes by, bg,..., 6, 
to be singular and the nodes b_,,,...,b, regular, where 0 < 7 < n. 
If 7 = 0, then, by Lemma 1, [ possesses a factor. Let 7 > 1. Choose 
any v <r. Since [ satisfies H(A) and I'(b,) does not satisfy H(A), 
it follows that thete is a critical set S, satisfying b, ¢ J(S,). Then, by 


> a 9? » ge 
Lemma 2, the set S* —S,4...48 
is critical. Also, 
{b,,...,b,} c J(S*) = T* (say). (1) 
By definition of critical sets, |S* T’*|. By numbering the nodes 
(y,...,4,, Suitably, one can assume that 
Ye f ’ ‘ 
S* = {a,,...,@}. (2) 
By numbering the nodes b_,,, b,,»,...,0,, suitably, one can assume that 
Te __ ’ 
T* = {b,,..., 0,3. (3) 
By (1, l<r<l<macn. 
The nodes @,, @,...,%, 0), 0).9,.-.,0, are regular. Let 
Pa < Po < oe << py < t. 
Then, by H(A), |J (a,.,)+ +...+ J (a,,)| sa 4 


Hence, by (2), there are / distinct nodes 

b,,ES(q) (L<A</)). 
By (1), (2), (3), we have vy, <1. Hence 1, v9,..., v, is a permutation 
oft. 2.4. 


By H(B), |J (bp 415---)5,)| > n—l. 
On the other hand, by (1)-(3), 
I (Bp.4992-2 Og) C {y4350005 Beh, 
and therefore |J (Dp41---)5,)| < m—l < n—l. 


Hence, m =n. Letl+1<y<¥,<...< ». Then, by H(B), 
|J(b,,)-+.-+I(b,,)| > &. 

Therefore, by (2), there are n—/ distinct nodes 

a,,EdS(b)) (I41<A<n). 
By (1)-(3), wy Sl+1. Hence py,.;, py+95---, Hy, IS a@ permutation of 
14-1, 14+-2,..., n, and we conclude that 

{(41, D,,), (Gay Dy) sees (Ap By,)s (Qparz ys Orga) s+++9 (Gags On) 

is a factor of [. This proves Theorem II and therefore Theorem I. 
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10. I am indebted to a referee for the remark that if Theorem II 
is known to be true for graphs without loops, it automatically follows for 
graphs with loops, by means of the following device. Let T be a locally 
finite generalized graph, and let A, B be a parity classification of I. 
Suppose that T satisfies H(A) and H(B). Define a graph I’ as follows. 
The nodes of I” are all the nodes of [' and, in addition, sequences 
of infinitely many nodes 2, 2»,..., one such sequence corresponding to 
every singular node x of I’. All additional nodes are different from 
each other and from the nodes of [. As edges of I’ we take all those 
edges of [ which are not loops and, in addition, corresponding to every 
singular node z of I’, infinitely many edges 

£=%24,=2,=.... 
> 
Put A’ = A+ } {x5 %4,2%¢;...}+- > {x1,2,...}, 
reA reB 
B’ = B+ > {2,,2,,...}+ > {,,2g5.--}; 
xreB reA 


where x ranges through all singular nodes of [. It is easy to see 
that (i) I’ is a graph without loops, (ii) A’, B’ is a parity classification 
of I’, (iti) I’ satisfies H(A’) and H(B’). Ifthe conclusion of Theorem II 
holds for I’, then [’ possesses a factor F’. Then I possesses the 
factor F obtained from F’ by omitting all edges (x,,x,,,) in F’ and 
replacing every edge (x,2,) in F’ by the corresponding loop (7,2). 
Unfortunately this remark, as far as I have been able to see, cannot 
be used to simplify the proof, since the graphs I, in § 5 may contain 
loops, even in cases where I’ contains no loops. 


11. I shall now re-state Theorem IT in terms of general set theory. 
In this formulation the exceptional part played by loops is abolished. 
Let P be an abstract set, the index set. The letter v denotes a typical 
element of P, and the letter N denotes a typical finite sub-set of P. 
Let Q be a set, and Q be a system of sub-sets of Q, 

Q: <A, (ve P). (4) 
A sub-set P’ of P is called discrete in Q iff 

A, A,, = 9 (v, Avg; », € P’; v,€ P’). 
The system Q is called separable if P = P,+P,, where both P, and 
P, are discrete in Q. The system Q is restricted if, corresponding to 
every v, there is N, such that 
A,A,=96 (v' EN). 
+ @ denotes the empty set. 
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A set R is called a representative set of Q iff 


Rc A,; |A,R\| =—1 (ve P). 


Put, for P’ c P, oP’) = > A,, 
ve P’ 
THEOREM III. A separable, restricted system Q possesses a repre- 
sentative set if and only if 
QUIN) ¢ Q(N’) 


whenever N is discrete in Q and |N| > |N’|. 


Condition (5) is, in fact, necessary for every 2, whether separable 
and restricted or not. For, if R is a representative set of Q, and N 
is discrete in Q, |N| > |N’|, then 

|RQN)| = |N| > |N’| > |RQWN’)|. 


Hence (5) follows. This condition is, however, no longer sufficient 
if Q is restricted but not separable, nor if Q is separable but not 
restricted. This is seen by means of the following two examples of 
systems neither of which possesses a representative set: 


‘ $1 DS 2i JD Qi 
Q,: {1,2}, {1, 3}, {2, 3}, 
° f Q 5 r Dr’. fOr Se ’ f4rt SR RI 
Q,: {1,3,5,...}, {1, 2}, {2}, {3, 4}, {4}, {5, 6},.... 


These examples are connected with the special graphs [), [, defined 
in § 2 by a process which I now want to describe generally. 

Let Q be any system (4) satisfying (5). Define a generalized graph 
[ = ¢(Q) as follows. The nodes of I are the elements v of P. The 
edges of [ are all pairs (v,,v,) satisfying either 


(i) Vy # Vo; 4,,4,, a 0 
(ii) vy = v93 A, ¢ > A,. 


Now suppose, in particular, that Q is separable and restricted. 
Then it follows at once from the definition of [ that [ is even and 
locally finite. Let A, B be a parity classification of [. Then (5) shows 
that [ satisfies the conditions H(A) and H(B). Hence, by Theorem IT, 


+t A less formal interpretation of this definition is that a representative set 
is a ‘Joint Committee’ of societies A,, on which every society has exactly one 
representative. If the membership lists of the societies show any arbitrary 
degree of overlapping, a ‘Joint Committee’ of this type need not exist. 
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there exists a factor F of [. Corresponding to every edge y = (v4, v2) 
of F, one can select an element a(y) of A in such a way that 
a(y) € 7 of Vy - V25 
aly)e A, —A,, > A, tf vy = ve. 
vFYU 


Then > {a(y)} is a representative set of Q. Thus Theorem II implies 
yer 


Theorem ITI. 

It is equally easy to show that Theorem III implies Theorem II. 
Let [ be any generalized graph. Define a system Q = ¢(T) as follows. 
The index set P of Q is the set of nodes of [. If v ¢ P, then A, is the 
set of all edges incident with v. If, now, I is even and locally finite, 
then %(I°) is separable and restricted, and if 4([) possesses a repre- 
sentative set, then I’ possesses a factor. 


12. In proving Theorem II, we applied Hall’s Theorem (2) to 
special finite systems of sets. In conclusion of this note, I want to 
show that, quite generally, the possibility of selecting distinct repre- 
sentatives, in the sense of Hall’s Theorem, of any system Q of sets is 
equivalent to the possibility of selecting a representative set, in the 
sense of Theorem III, of some special separable system Q*. 

Consider an arbitrary system of sets 


Q: A, (ve P). 


There is no loss of generality in assuming that P > A, = 6. A func- 


v 


tion f(v), defined for every v € P, is called an H-function of Q, if 
fv)eA, Wve P), 
f(r) FfW2) (vy # v2). 


Let « denote a typical element of the set 


Q= 2 A,. 
Put g(v) a {(a, v)} (ve P), 
h(a) = {a} +2 {(a,v)} (ae Q). 


In these formulae, (a, v) denotes ordered pairs consisting of an element 
of Q and an element of P. Consider the system of sets 


Q*: g(v) (ve P), 
h(a) (aeQ). 
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The index set of Q* is P+@Q. The following propositions are easily 
deduced from the definition of Q*. 
(i) Q* is separable; 
(ii) of f(v) (ve P) is an ae of Q, then 
> (FO). }+(2— EF )}) 
is a representative set of Q*; 
(iii) of R* is a representative set of Q*, then 
R*g(v) = (f*(v),v) (ve P), 
where f *(v) is an H-function of Q. 
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DIFFERENTIAL EQUATIONS WITH 
POLYNOMIAL SOLUTIONS 


By T. W. CHAUNDY (Ozford) 
[Received 23 June 1947; in revised form 31 January 1948] 


1. Introduction 
W. W. SawyeEr* has considered the problem of finding systems of 
linear ordinary differential equations 
(F—AG)Y = 0 (1) 
(where A is a disposable constant) such that, when A= A,, the 
equation is satisfied by a polynomial of degree n and this for every 
positive integer n as well as for n = 0. Sawyer has discussed only 
systems of the second order, but, although the chief practical interest 
may well lie here, there is no need in theory for the limitation. Thus 
it is a commonplace of hypergeometric theory that, when 6 denotes 
«D, n is a positive integer or zero, and f, g are polynomials with 
constant coefficients (but subject to certain precautions on their 
zeros), the equation 
Sf(8)Y = x(5—n)g(5)¥ (2) 
has a polynomial solution of order ». Writing A for n in (2) we 
evidently have a system (1) with the desired property, and it was 
from this point that Sawyer took up the problem: to investigate 
whether similar systems existed outside the hypergeometric range. 
I approach the problem by a method somewhat different from 
Sawyer’s, but in either way it is soon seen that systems such as (2) 
give a particular, not a general, answer. In what follows I hope to 
answer the question of the general form of (1), with some convenient 
interpretation of the word ‘general’; and I shall outline the solution 
of the second-order equation since this is more particularly Sawyer’s 
problem. 


2. Preliminary clarification 
2.1. I follow Sawyer in avoiding the possibilities A= 0, «0 by 
replacing A, if necessary, by (A—h’)/(A—h) where h, h’ are two (real) 
numbers absent from the enumerable set (A,,). This replaces (1) by 
(hF—h’G)Y = A(F—G)Y, 
which is still of the same form. 
* See above pp. 22-30. 


{Quart. Journ. of Math. (Oxford), Vol. 20, June 1949, pp. 105-20] 
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2.2. The equation (1) written in extenso will have some form 
(a) D?+...4a,)¥ = A(B)D?+...+-B,)Y, 

where ap, Bo,.-., x,,, 8, are 2p 2 functions of the independent variable x. 
To determine their ratios we can substitute into the equation 2p+ 1 
different polynomial solutions together with the appropriate values 
of A. 

Now the form (1) is not unique since, if © is an arbitrary linear 
operator, the equation 

O(F—AG)Y = 0 
is satisfied by all the solutions, including the polynomial solutions, 
of (1). We may therefore find that the 2p }+-1 equations to determine 
the ratios of «, 8 do not do so uniquely. It may even happen that 
the substitution of all sets of polynomial solutions still leaves the 
x, 8, ambiguous. In this case the polynomials are common solutions 
to (say) 
(F,—AG,)Y = 0, (F,—AG,)Y = 0, 

The operators F.—AG, will have a common end-factor F—AG which 
we sufficiently take as defining the equation (1). Since all the solutions 
that we have used have been polynomials, the coefficients in F,, G,, 
and therefore in F, G, are sufficiently polynomials. 

Changing the fundamental operator from D to 6 (which still leaves 
the coefficients polynomial in a) and arranging the equation in powers 
of x, we sufficiently consider differential equations of the form 


F(x,6)¥ = AG (x, d)Y, (3) 
“Me & 2. e . 
where F(x,5) = > a’f,(8), G(x,8) = > 2’9,(5), 
r=q r=q' 
and p >q, p’ >q’. In particular we consider the equations 
5h : ' — = 
[ > 27-(8)|¥, = A,| > 2°9-(6)]Y,,, (4) 
fr=gq r=q’' 
where Y,, is a polynomial of some form 
a 2" On n—1 xn ; +++ Ong: (5) 


As Sawyer has already emphasized, by a polynomial ‘of degree n’ 
we must be understood to mean ‘of degree exactly n’ and not merely 
‘of degree not exceeding n’: otherwise Y = a constant could be a 
sufficient solution for every A,. Thus the term 2” is always present 
3) 

In (4), when the operations 5 have been performed, we are left on 
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each side with a polynomial, which we suppose arranged in powers 
of x. The leading terms on the left and right are then respectively 


fp(n)a"*”, A, fy(n)x” +p" 


Thus, unless p = p’, one or other of these would be an isolated term 
in (4), and we should need 


f,(m) =9 or f,(n)=0 
for all n, which is impossible. Hence p’ = p. 

At the other end of the summation we can ensure, by writing 1/A 
for A if necessary, that G contains as low a power of x as F and, in 
fact, dividing through by this power we can take gq’ = 0. We can 
further take q = 0 by admitting, if necessary, f,(5), f,(8),... identically 
vanishing. It is thus sufficient to consider (4) in the form 


P. ™ 2. * 
[ > 2°7.(8)|¥, = au[ > 2°9,(8)]¥,,, (6) 
r=0 r=0 
where expressly Jo(5) is not identically zero. (7) 


2.3. By considering the series of polynomials Y,, we can successively 


choose constants €,, ,,_1,---)€,9 Such that 
= Yi+Cnn—1Yn-1t---+eno Yo: (8) 
and, of course, 1 = Y,. Thus, in virtue of the defining identity (6), 
F(x, 8)a" = G(x, 8)Z,,, (9) 
. 4 F aS a 7 | 
where Z,, An Yn t+An—1Cnn-1Yn-1t-+-+AoCno- 
Written out we may suppose that 
Y, - Y yn—1_} | 
ZH yg B® 48g 1 "+... +O ng: (10) 


2.4. Conversely, to obtain the Y,, from the Z,, we need to determine 
constants a,, such that simultaneously 
Y, = x" +n n-1 a+... +Gng, 
An A he Zyt+nn-1 Zyrat + Tn Zo. 
For then, by (9), 
F(x, 5)Y,, mas G(x, 8)(Zn+Onn-1 Z.-1+ Ang Zo) 
= A, G(a, 5)Y,. 
The a,, have to satisfy the identity in 2, 
An ("+En nr "1+... +@ng) 
x (A, x” +O,4-1 iii + os) AEnn—1(An—a ™—1+- bya not" “8+...)+ 
+4yn-2(An—2 a-24-b, 94-30" + we) + ore 
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Thus, on comparing coefficients, we need 
(A 
(A 


—A,-1 8a» -1 = Man -1 


n 
n—Ag-ainend Onna ned bn -1n—29 

We can consequently determine the a,, in succession without infinities 
so long as the X,, are all different. With this condition satisfied, the 
defining identity (6) can sufficiently be replaced by the derived identity 
(9) for all n, and I shall appeal to one or the other as may be more 


convenient. 


3. A sufficient form 
3.1. One can now, without more ado, write down a sufficient form 
of (1). I give it in a theorem: 
THEOREM A. The system of equations 
p 
[ > 2?-78(8—1)...(8—r-+ Ih, _,(8)| Y 
= 
Pp : = 
“a [ > 2 5(8— 1)...8—r-+ 1)jp-+(8)]3 (11) 
r=0 
(where the h, j denote polynomials with constant coefficients) have 
polynomial solutions of every degree with 
A, — h,(n)/j,(”), 
provided that these X,, (n = 0, 1,...) are all different. 
This is soon seen. For in (11) write A = A,, substitute 
Y,, = B® 4G y 9-12" 1+... + Ong 
and suppose the operations 6 performed. Then the operator 
8(5—1)...(6—r+1) 
appearing in the coefficient of x?-" on either side removes from Y,, 
powers of a below 2”, and so no term is left in (11) below x?. The 
highest power is «”+”, and thus comparison of coefficients gives 
n-+-1 equations to determine the n-+-1 constants A,,, Gp ny.) Ano 
These equations are 
0 =h,(n)—A, j,(n), 
0= An n—1hp(m ——§ )—A, jp(n a. 1); +nth,1(n)—A,, Jom}, 
O = Ay n-2{h,(n—2)—A, jyp(n—2)}+-... 
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The first gives A,, = h,(n)/j,,(m), and from the others we can determine 
. in succession (without infinities) provided that none of 


h,(r)—A, jy(r) (r = n—1,n—2,...,0) 


is zero; and this is covered by the proviso of the theorem that the A,, 
are all distinct. 


a n,n—-1 Bn n—2** 


3.2. One naturally inquires how general is the form (11). Writing 
it shortly as [H(«,8)—AJ(«,8)]¥ = 0, (11a) 
we can rewrite it as 

[H(x,5—m)—AJ(x,5—m) |x"Y = 0, 
and consequently, when m is a positive integer, the system 
[H(x,8—m)—AJ(x,5—m)]¥ = 0 (12) 
has polynomial solutions Y,,,,, = 2”Y,, of all orders m and upwards, 
with the same set of values (A,,) of A. If on such a system (12) we 
impose the further finite set of conditions necessary for it to have 
polynomial solutions of degrees 0, 1,..., m—1, then we again have a 


system having polynomial solutions of every degree. 
One such system is 


[H(x,3—m)—AJ (x, 8—m) |8(5—1)...(8—m+1)¥ = 0, (13) 


for it is still of the form (12), and the operator 5(8—1)...(6—m-+1) 
annihilates all polynomials of degree less than m. More generally I 
consider a system in which the missing polynomial solutions are just 
the simple powers of «, i.e. x" (n = 0,...,m—1). In this case we find 
that we can write F’, G by interpolation in the forms 
p => e 
F(x,8) = > w~{B6)+ 58 1)...(8—m—r-+1), (14) 
ii s=0 
m—1 e 


G(x,8) = § 2-+{Q,8)+ b | 5, }8—1)..8—m—r +1), (15) 
va s=0 





where the e,, are a set of arbitrary constants and P,, Q, are arbitrary 
polynomials with constant coefficients: these latter contribute a 
‘complementary function’ of the form of (13). Here the coefficient 
of x?-" in each operator is of the form (8—m)...(8—m—r-+1) multi- 
plied by an operator integral in 8. Thus (14), (15) give 


F(x,8)—AG(a, 8) (16) 

















110 T. W. CHAUNDY 
of the required form 
H(x,5—m)—dAJ(x,5—m). 
In operation on x” (n < m) the operator (16) reduces simply to 


$ xP-"(r, —Ale 8 (8—n-+ 1)(5—n—1)...(8—m—r-+]), 
7=@ 


rm Ser 
since the other terms of this operator annihilate x”. Thus 
7 i, 
F(x,8)—A,, G(x, 8) 
annihilates 2”, and the set of polynomial solutions is complete. 


3.3 This system looks somewhat artificial against the system 
defined in Theorem A. We ought perhaps to think of it as the 
‘derived’ or ‘secondary’ type, with (11) as the ‘fundamental’ or 
‘primary’ type. I shall distinguish as the types o/, %,, respectively 
those defined by (11) and by (14), (15). It is characteristic of %,, that 
its polynomial solutions up to degree m are the simple powers x” 
(and we may include x” itself). In the limiting case we could include 
,, in which every polynomial solution is the simple power 2”; and 
at the other extreme .97 is, in effect, A. 

The differential equations (11), (14) are both of ‘rank’ p, in the 
sense that they contain p-+1 powers of x and so p distinct steps 
between these powers. In (11) the order of the equation is at least 
p since the coefficient of the absolute term (the term independent 
of x) contains the factor 6(8—1)...(s—p-+1). In (14) the order of the 
absolute term and therefore of the equation itself is at least m+p—1. 

Conversely, then, with equations of given order n, the rank of 
types W, A, is at most n; the rank of any type 7, is at most n—m-+ 1. 
Thus the second-order equation of these types is not necessarily 
hypergeometric but may be of rank 2. The first-order equation is at 


most hypergeometric. 
3.4. As a simple illustration consider the possible first-order 
equations. Type ~ then gives most generally 
[w(ad’—b)—cd|¥ = Ala(a’5—b’)—c'S]Y. (17) 


This we can write SY = [(6—ByY, (18) 


Z 


c—Ac’ b—Ab’ 
= eee B=- - 
i—i” a—)a’ 





where A 
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js : an—b 
Now, 17), A, = > 19 
with the condition ab’ + a’b (or else X,, is independent of n). Then 


(ca’—c'a)n+ (be’—b'c) 


A, = —— > a cee B, = a. 
Thus, from (18), Y,, = (x#—A,)". (20) 
We can impose a linear transformation on x to reduce this form to 
one of Y, = 2* (ca’ = c’a), (21) 
Y, = (X+n)" (ca’ <c’a). (22) 


The type &%, is quickest dealt with by writing 5—1 for 6 in (17). 
This multiplies the solution Y by 2 and replaces n in (19) by n—1. 
We therefore have B, = n—1, and so 

Y, = x(2+A4,)". (23) 
We have still to impose the condition that Y, = 1 is a possible solu- 
tion. This gives 

a+b = d,(a’+0’), c=A,c’,, 

and so Ay = 0, as is at once evident from (23). A suitable linear 
change on a reduces this to 

Y, = X(X+n)"-. (24) 
These forms (21), (22), (24) are the three that Sawyer has given as 
essentially the only polynomials that can arise from equations of the 
first order.* 

So far I have proved only that the forms o~/, <A, are possible 
systems of Sawyer’s type. I now hope to show that every system 
belongs to one of these types or is ‘reducible’ (in a reasonable sense) 


to one of them. 


4. Generality of the forms 7, 7, 

4.1. It is characteristic of the general form .%~/, i.e. when A is not 
restricted to one of its special values A,,, that it has in general p 
distinct solutions in series 


a, x". (25) 


Ms 


Y = 


rT 


For, as in § 3.1, if we substitute ( 


li 


no oe 


¢ 


5) in (11) and operate, x” is the 


* Loc. cit. 30 (19). 
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lowest power of x that emerges. Thus the equations to determine a, 
are of the form 


0 = ag{h,,(0)—Aj,(0)}-+a,fh,,-4(1)—Aj,,-1(1)} +--+ 
+p! Ay{ho( P) —Ajo(p)}, 

0 = a,{h,(1)—Aj,(1)}+ 2afh,, o(2)—Aj,-2(2)} +... + 
+(p+ 1)! a, .,{ho(pt+1)—Ajo(p+1)}, 


We can therefore choose dp,..., @,, with p degrees of freedom, and the 
remaining @,, ,;,... are then determined uniquely in succession (barring 
infinities). This gives us p independent solutions in series. The 
separate equations 

F(x,3)Y = 0, G(x,5)Y = 0 
also have this characteristic property, of course. 

Presumably these series converge in some interval |#| < p, but 
this question of convergence does not affect the argument, which, 
in essence, is concerned only with the partial sums in these series. 

4.2. Suppose now that in a system (F—AG)Y = 0 enjoying 
Sawyer’s property, which I suppose written in the form (6), the 
equation GY = 0 has just r solutions S,,..., S. of the form (25): or 
more precisely that 

g(t) = 0 (¢= 0,...,.r—s—1; 8s = 0,...,r—1), (26) 
where, for the moment, I impose no restrictions on r. These con- 
ditions ensure that, in the defining identity 


F(a, 8)Y,, = A, G(x, 8)Y,, 
after differentiation no term below x” survives on the right. The 


same must therefore hold on the left, and so the conditions (26) 
necessitate the similar conditions 

f(t) = 90 (¢=0,...,r—s—1; 8s = 0,...,r—1). (26a) 
Thus, when r = p, we get the full conditions to define ». When 
r > p, the conditions (26), (26a) include the conditions defining .~/, 
and we therefore obtain a particular case of o. Accordingly the 
conditions (26), when r >, merely recover the type &: new 
possibilities arise only from r < p. 

We can write the derived identity 


F(x,8)a" = G(x,8)Z, (n>7) (27) 














DIFFERENTIAL EQUATIONS WITH POLYNOMIAL SOLUTIONS 113 
in the form G(x, 5)Z,, = 0+ O(a"), 


since no power below x” occurs on the left. Thus, correct to x"-", 
Z,, is a series solution of G(x,8) and we can write, for some c, 


Zn = Cy Sy +... +9, S,4+ O(x). 
Similarly Zn = Cy S,4+...+¢,, 8,4 O(a"). 
Proceeding in this way down to 
Zy—r = CS +... +p, S,4+ O(a"), 
we can then eliminate the S,,..., 8, between the r-+1 equations for 
} ene Z This gives an identity of the form 


Bon Z,, + see + | akan = O(z**). 


The highest term on the left is 2” from Z,,, and so we can write more 


n? 


n-rT* 


explicitly 


Bon 2Znt---+ Brn Zn—p = Aon t"+---tApn 2”. (28) 
Operation with G(x,8) gives, in virtue of the equations (27), 
F(x,8)(Bo, 2"+...+ B,, 2") = G(x, 8)(Ao, x" +...+Apn 2”). (29) 
The coefficients A, B involve n rationally and, by clearing of fractions 
in » if necessary, we can make them polynomial functions of n. 
lo emphasize this I rewrite (29) in the form 
F(x,8){B,(n)+...+27B,(n)}a" = G(x,8){Ag(n)+...+-07A,(n)}a”. 
(30) 
Negative powers of x will not, of course, occur in this identity and 
to exclude them the functions A, B must be such that 
A,(t)=0, B(t)=0 (t<s). (31) 
We can rewrite (30) in the form 
F(x,8)B(x,8)a" = G(x, 8)A(x, d)a”, 


where A(x,8) = Ao(3)-+...+2-7A,(8), 
B(x, 5) = Bo(d)+...+a-7B,(8). 
Then the differential equation 
[ F(a, 5) B(x, 8)— G(a, 8)A(x,5)|¥ = 0 
has the infinity of linearly distinct solutions 
Y=2" (*h=f,r+1....), 
and so we have, identically in 2, 6, 


F(x,5)B(a,8) = G(a,5)A(z, 8). (32) 


3695.20 I 
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4.3. Consider the system of differential equations 
x"| A(x, 5)—AB(x,5)|y = 0. (33) 
The conditions (31) are analogous to the conditions (26). They show 
that A,(5), B,.(5) are divisible by 5(8—1)...(8—r-+1) and so on: in 
fact that (33) is of type %, with r replacing p. It has polynomial 
solutions y, of every degree n with A,, = A,(n)/B,(n). 
Now from the identity (32) we have 
[ F(x, 5)—A,, G(a, 5)| B(x, 8)y,, = G(x, 8)[ A(x, 8)—A,, B(x, 8)]y, = 0. 
(34) 
Thus Y,, = B(x, d)y,, (35) 
satisfies the equation 
[ F(x, 5)—A,, G(x, 8)|¥,, = 0. 
Further Y, is a polynomial, being a sum of polynomials B(x, 8)a* 
(¢ = 0, 1,...,”), and is precisely of degree n, the term of highest order 
coming from B(x, 5)", which is B,(n)x”. Thus 
[ F(x,8)—AG(a,5)|Y = 0 (36) 


has polynomial solutions of every degree, obtainable in the form 
(35) from the polynomial solutions of (33). It seems not unreasonable 


to regard the system (36) as replaceable by the system (33). When 
r <p, the system (33) is of lower rank than (36), and I shall say 
that (36) is reducible to (33). Thus, when the conditions (26) are satis- 
fied with r < p (that is the conditions are insufficient to give (36) the 
form ./) it is reducible to a system of this form of lower rank. 


4.4. Alternatively to (26) we might have 
g(t-+m) (¢ = 0,....r—s—1; 8 = 0,...,r—1) (37) 
for some positive integers m, r, so that the equation G(x,6)Y = 0 
has 7 solutions in power-series beginning with x”. The foregoing 
arguments show that 
[ F(a,8—m)—AG(x,5—m)|Y = 0 
is either of type ~ or reducible to 
[A(x,8—m)—AB(x,5—m)|¥ = 0 
of type # and lower rank. In this latter case 
[A(x,8)—AB(x,8)|Y = 0 


has polynomial solutions of every degree from m onwards. 











DIFFERENTIAL EQUATIONS WITH POLYNOMIAL SOLUTIONS 115 


To supply (38) with the missing solutions of degrees less than m 
we must re-examine the arguments of § 4.2 more closely under the 
conditions (37). We now have 


g(t) AO (t<m). 


Thus, if Z, had a last term 2"-*, this would not be annihilated by 
go(5) and would survive as an isolated term in the identity. Hence 
we must have Z,=A.x” (n<m). (39) 
This shows further that 

Y,=2" (n<™m). 

The series S,,...,S,, since their leading term is 2”, do not enter 
with these values of n, and the identities (28) are simply (39) when 
nm <m. Thus, when n < m, 

5, = I, A, = A; B., = 0, A,, =9, (¢= I.,...,¥), 
so that Biz,n)=1 (n<m). 

Then, in (35), Y,, = 2” gives also y, = x", and (38) has the poly- 
nomial solutions 

¥,=2* (n= 0,1,...,.m—1). 
This completes the solutions of (38) and shows it to be exactly of 
type ~,. Thus under the conditions (37) the system is of type .%,, 
or reducible to a system of this type of lower rank according as 
r>porr< Pp. 

We should note that the type .~%, can be obtained simply enough 
by choosing the constants in .~ so that the differential equation has 
the additional solutions z-",...,a-” and then writing 5—m for 8. In 
the polynomial solutions Y,, themselves we make the same choice of 
constants and then write 

Y,, = 2*Y,_,.- 

4.5, There remains the case when g,(¢) = 0 has no positive integer 
or zero solution, and so G(x,8)Y = 0 has no solutions in power-series. 
In effect we are taking the limit m — 00 of the case just discussed in 
§ 4.4. By the arguments of that paragraph it is clear that the poly- 
nomial solutions are all single powers, and the defining identity is just 


F(x, 3)a" = dX, G(x, d)x". 
This gives, for all n and every r, 


f.(n) _ A, g,(n), 
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and the system now reduces to 
[ fp(8)—Ag,,(8) |¥ = 0 (40) 

of zero rank (i.e. the equation is homogeneous). The identity (32) 
now becomes , ' , 

F(x, 8)g,(8) = G(x, 8)f,(8), 
or indeed any of F(x, d)g,(5) = G(x, d)f,(8), 
which suggests that, in general, the A, B of (32) may not be unique. 

If the polynomial solutions are our only interest we can, of course, 

replace (40) by the first-order equation 

(6—A’)¥ = 0 (41) 
with X}, = n. I do not know whether this reduction in the order of 
the equation by seeking only the polynomial solutions is possible in 
a more general case. 

At any rate the argument is now complete, and it has been seen 
that every system of Sawyer’s type is either o or %, or reducible 
to one of them. 

5. Mixed forms in D, 6 

5.1. We can put the type 9° into a form that is perhaps significant, 

by using the elementary identity of operators 
8(5—1)...(8—r-+1) = aD. 

For this reduces (11) to 

. ’ ae 

x»| S Drhy-1(8)|¥ dv? | S Drj,,(8)]} 

r=0 r=0 
i.e. simply H(D,5)¥ = AJ(D,8)Y, (43) 
where H, J are polynomials with constant coefficients. These are all 


arbitrary since the polynomials h, j in (11) are themselves arbitrary. 
We note too that 


h,(8), 9,(8) = H(0,8), J(0,8), 


and so the A,, are given in (43) by 


A, = H(0,n)/J(0,n). (44) 

Although the operators D, 6 are not commutative and it may be 

convenient to suppose that D always precedes 5, yet the form (43) is 

still valid for any mixed order of D, 5, e.g. D°5D instead of merely 

D885 or 5D*. Moreover, since Dé—d5D = D, this does not vitiate (44). 
This type includes the hypergeometric type in the form 


[PD fo(8)+fi8)]¥ = ALD go(8)+-91(8)]¥ 
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or, more generally, 
[D°fo(8)+-F(8)]¥ = ALD? go(8)+-9,(8)]¥. 
It includes also equations of the type 
[5fo(D)+fi(D)Y = Al8go(8)+-91(D)|¥ 
soluble by Lagrange’s substitution y = | e"T dt. 
Equations with constant coefficients 
[(D)¥ = Ag(D)Y, 
which do not give polynomial solutions, are excluded since they give 
all A,, the same by the formula 


f(0) = A, 9(0). 
5.2. The substitution (42) effects also some simplification in the 
forms (14), (15) of type %,. We can, in fact, write them in the forms 


m—1 
F(x,8) = - Del fD,8 + > HEAD) (D) 5 =<): 


m—1 


: 1 
G(x,8) = D la (D,8)+ D BAD) = ait 
where the E,(D) are the arbitrary polynomials of order p, 
ED) = ¥ ¢,,D". 
r=0 


6. Systems of the second order 
6.1. The most general .o/ of the second order is of rank 2 and 
gives for polynomial solutions 


[x*{(a—A,, a’ )8*-+- (b—A,, b’)5+-(c—A,, €’)}}— 
—a{(d—d,, d’)5+-(e—A,, e’)}8+-(f—A,, f’)8(6—1)]¥, = 0. (45) 
Here (a—A,,a’)n? LO—-At jn+(c—A, c’) = 0, (46) 
and so the coefficient of x? reduces to 
{(a—A,, a’)(8+-n)+b—A,, b’}(8—n). 
I think of the polynomial solution in the form 
Y,, = #U,(x-), 
and therefore, writing 5+-n for 5 and then 


x =2-1, 3’ = —8, 
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we have the equation for U, in the form 
[3"(8’ +a-+B—1)—a'{p(8'-+a)-+4(8’+B)}(3’—n) + 
+ pqu’2(5’—n)(8’—n+1)]U, = 0, 
ee ee j—A,, i 
rs Daal at” g’’ 


n 


p(a-En)+¢(8-+n) = —— 


a—A,,a’ 


where 


Using (46) and writing 

(a,b) = ab’—a’'b 
and similarly in the other letters, we can express these with explicit n 
in the forms 


_ @,ayn®+ (bn +0) 
= (a, b)n+-(a, c) 
(f,a)n?+(f, 6)n+(f, ¢) 

(a, b)n+(a, c) 
(a, b)n?— (b,c) 
on — 9 ae 52 
a+ Pra (a, b)n+-(a, c) 
- __(¢,a)n?+(e,b)n+(e,¢) 
Pla-+n)-+g(B-+m) m= — ee 
6.2. Now it can be verified without much difficulty that the 
equation (47) has for one solution the terminated Appell’s function* 


U,, = F°[—n; «, B; «+B; px’, qa’, (49) 
and with these parameters the function is reducible to the simpler 
hypergeometric form 


U,, = (1—gqza’)” A — N, 03 ap; 2— ara iF (50) 


 , Saaeelie 








which can also be written 


T (B)n aj abe? * pa’ 
U, = GA), (1—gqa’)” Al n, a; 1—B—n; I, 


so that 


(B)n (a— q)"s EF i] —m 25 1—p—n; P|, (51) 


~ (a+B), 


* Defined, for example, in W. N. Bailey, Generalized Hypergeometric Series 
(Cambridge, 1935), 73 (1); see also 79 (1) and 4, § 1.4(1). I have written F 
for F, to avoid confusion with ,F,, etc. 


o—q 
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Explicitly we can write this polynomial as 
1 —. [n 
Y, =—- (B),—-(x—p)"(a—q)"—. 52 
=o m2, (7) (Bn e—pY(e—@) (52) 


Professor J. L. Burchnall, to whom I am indebted for much helpful 
comment on this paper, has pointed out to me the elegant equivalent 
to (52), 

(x—p)P+"(a—g)e* - 
z, = (—)- D{(c—p)-P(a—q)-}, (52a) 
HB, 
which reduces to Legendre’s polynomial when 
a=Pp=—n, pP.q=Ht1. 

Here p, g, «a, 8, are (irrational) functions of » defined by (48). 
But they could be any functions of n (or constants) without altering 
the polynomial character of (52). 

The equation for Y,, i 

[2%(8—n)(S—a—B—n-+1)—a{ p(8—a—n)+-9(85—B—n)}8-+ 

+pq 5(5—1)]¥, = 0, (53) 
and p, q, a, 8 have to take the forms given by (48) in order that 
(53) may have the form (47) linear in A,,. This suggests that Sawyer’s 
form [F—AG]Y = 0 may be too restrictive and that we should 
perhaps consider more generally 


N 
3A, B(x, 8)¥ = 0. (54) 
r=1 





6.3. It remains to consider briefly the two forms A, ~A, available 
for second-order equations. For .%, the equation (45) has the solution 
x! when A = Ay. Thus 

a—b+c = A,(a’—b’+c’), 
d—e=),(d'-e’), f=A/f’. 
This gives, as is seen from (52), qo, % = 0 (or Po, By = 0). The 
solution is then, from (51), 
= gs necarsainnapntll 

For %, the ictinn (47) has solutions 2-1, «-? when A = A,, Ag. 

Hence 
a—b+c = i,(a’—b’+c’), 4a—2b+-c = A,(4a’—2b’+-c’), 
d—e = i,(d’—e’), 2d—e = A,(2d’—e’) 
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and f=A, Sf’, f=Asf’ 20 that f,f' =— 0. 
Thus pg = 0 (say g = 0) and the rank falls to one. The solution is 


now (8) 
Y, = +A), a", F[2—n, «a; 3—B—n; 1—p/z]. 


For values of A outside the special values 4,, we reinterpret. the 


nt 
solutions Y,, so that » no longer denotes a positive integer. Thus the 
foregoing analysis contains, in essence, the solution of the general 


Sawyer’s system of the second order. 


[Added 6 June 1949. Professor Sawyer, who has seen a proof of 
this article, has pointed out, by an example, the insufficiency of the 
classification of types in §4. As a consequence the second-order 
forms given in § 6 are not complete. This deficiency will be further 
considered. ] 








THE DRAG INTEGRAL IN THE LINEARIZED 
THEORY OF COMPRESSIBLE FLOW 


By M. J. LIGHTHILL (Manchester) 
[Received 3 September 1948] 


In many solutions of problems of compressible fluid flow by use of 
the linearized equation there are singularities in the field which 
have no counterpart in reality, or near which the assumption of small 
perturbations breaks down, and yet the solution gives a good approxi- 
mation to the real flow except near these singularities. In this paper 
a form of the momentum surface integral for the drag is found which 
is correct however near to those singularities the surface is drawn. 
If a body be placed in a uniform stream of velocity (U,0,0) and 
if the local fluid velocity, density, and pressure be 
U(1+6¢/éx, 06/éy, 0p/0z), p, p, 
with p = p, and p = p, in the main stream, the integral in question is 


2 
drag = p,U* { ame{are(2) +(v4)| 24 |as, 
8S 


ra) 
where S is any surface enclosing the body, to which the unit outward 
normal is (n,, N,, N,). 
To prove this, first take S sufficiently far from all singularities for 
the flow to be given accurately by the linearized potential and for 
|\Vd| to be uniformly small on it. By conservation of momentum the 


drag is ad ad 
Ss ' 


on 


which can be written 


od 406 a 
ae +pU2— U?— i d8 3 
[lore aro® las, 
8 

using conservation of mass. For isentropic flow, by Bernoulli’s 
equation, 

od fdp ffi 1 dp 
—u{2 + 4vey| = [2 = | [+ o—po(—4 52) +] aw 

oa p PA “\ pi dp, 

P1 Pi 
= PoP (P+. 0 phi!) (4) 
Pi 2at\ pi Pi 

[Quart. Journ. of Math. (Oxford), Vol. 20, June 1949, pp. 121-3] 
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where a, is the speed of sound in the main stream. Equation (4) is 
still true if there are variations of entropy due to the presence of 
shock waves, since these are of order of the cube of the pressure 
changes at the shock waves. Hence 
moet a —ut = —402(V¢4)?+ 

Pi 

and, since 

P—P1 ~ (P—P) dp,/dp, ~ —p, M* og/ex, where M = U/a, 


wet é = ,(0d\? ; 
p+ pU*—py = (p—p)U* + be, U?(ae( =) —(v9y"] + 01194) 


= — 4, 02(ae(Z) +-(v9)'} + 01109). (6) 





sales) OV") (6) 





This proves (1) under the restrictions mentioned upon S. Now in 
replacing S by any surface surrounding the body let V be the region 
between fs old and new surfaces. Then the difference in the 
integral (1) over the two surfaces, by the divergence theorem, is 


J [eze(trtas) + 7"] te ( sera] a 


= | [are Sot emag & grad d— grad $ grad — — 26] ar, 
v 


Ox Ox" 
(7) 


which vanishes since ¢ satisfies the linearized equation 
M? 6*4/éx* = Vd. 

Hence the result is true in the generality originally stated. It should 
be noticed that it will normally pay to choose S so that on all crucial 
portions of it n, = 0. 
Appendix. The side force 

In the same way an expression for the component of force in, say, 
the y-direction can be found. It is 


no ol evcor-wef n(n 
(8) 


but, in first establishing this under the restrictions on S mentioned 
above, we cannot replace ¢ in (8) by the linearized potential without 
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possibly introducing errors of the order of the squares of the distur- 


bances and hence of the same order as certain terms in (8). Normally, 
then, all we can say is that the y-component of force is, to a first 


approximation, ad od 
0? | (zea) x 


but that when it is clear that this integral vanishes for the exact 
solution ¢ (over some suitable surface surrounding the body) the 
correct expression is 


p,U } [m, [acvaye—aaee(2)') +- (aren, E22) 8] a8. (10) 


Expressions (8), (9), (10) all have the property that they are equal 
over any two surfaces between which the linearized equation holds. 














QUADRATIC FORMS NEAR TO 2?—27? 
. By P. VARNAVIDES (London) 
[Received 18 December 1948] 


1. Ler S(«,y) = ax?+bay+cy? (1) 
be any indefinite binary quadratic form with real coefficients, and 
let d = b?—4ac be its discriminant. A well-known theorem of 
Minkowski asserts that for any real numbers 2p, y) there are integers 


alia If (e-+ay, y+yo)| < Iva. (2) 
This result can be improved on for individual forms f(x,y). In 
particular, for the form f(x,y) = 2*—2y?, it was proved by Hein- 
hold} that one can satisfy 

Sette y+¥e)l < $= pv. (3) 
I have recently madet a more detailed study of the form f,(x, y) by 
a method due to Davenport and have proved, in particular, that 


aaes %) = 0 (mod 1), Yo = 4 (mod 1), (4) 
one can satisfy the more precise inequality 
oa bos 3% , 
| f(*+2%9,Y¥+Yo)| 3 t — 39 vd. (5) 


Thus the result (3) is isolated in the sense that a considerably better 
result is true unless a», yp are of the special form (4), in which case 
obviously (3) cannot be improved upon. 

My object in the present paper is to show that the result (3) is 
isolated in a stronger sense, in that a better result also applies for 
any form f(x,y) near to f,(x,y), unless f is a multiple of fy and 2, y, 
have the special form (4). By saying that the form (1) is ‘near’ to fo, 
we mean simply that |a—1}, |b|, |c+-2| are less than certain positive 
absolute constants. A convenient formulation is obtained if we 





suppose, as we may without loss of generality, that a = 1, and if we 
specify that the roots of the form shall be near V2 and —¥V2. The 
precise result obtained is as follows.$ 


+ J. Heinhold, Math. Zeits. 44 (1939), 659-88. 

t P. Varnavides, Proc. K. Ned. Akad. Wet. Amsterdam, 51 (1948), 396-404, 
470-81. 

§ A result of this kind was proved in my Ph.D. thesis (London 1948) by a 


different method, base¢ on Lemma 5 of Davenport’s paper in Acta Math. 
80 (1948), 65-95. The result, however, was less precise than that given here. 


(Quart. Journ. of Math. (Oxford), Vol. 20, June 1949, pp. 124-8] 
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THEOREM. Let g and h be any real numbers satisfying 


I<g<¥, [<h<¥. (6) 

Let f(x,y) = (@—gy)(a-+hy), (7) 

so that d = (g+h)*. Then, for any real xy, Yo, there exist integers x, y 
such that } } 
vd 24\ vd 

[f(w+2%,y¥+Yo)| < 5808 ~ (slave (8) 


unless g = h = V2 (so that f is fy) and xo, yo satisfy (4). 

The proof is based on a result of mine (see Lemma 1 below) which 
enables one to deduce a non-homogeneous inequality such as (8) from 
a knowledge of one suitable value of the homogeneous form. To 
obtain a suitable value of the homogeneous form (7), we use the 
classical process, going back to Gauss, of expanding g and h as 
continued fractions. For this purpose, we consider instead of f(x, y) 
the equivalent form 

f(xy, y) = {a—(9+ly}{x+ (h—1)y}. 

I wish to thank Professor Davenport for suggesting this approach 
to the problem. 

2. Lemma 1. Let f* be any value of \f(x,y)| which corresponds to 
co-prime integral values of x and y, and which satisfies 0 < f* < vd. 
Let «* be defined by 


d 
8 ie See 9 
”  P 
Then, for any real xo, Yo, there exist integers x, y such that 
f(at+axy,y+Y)| <4, (10) 
i? if of <f, 
of * “a Pa ae 9 
where AM = wi v ated. (11) 
ue if 4h <d, 
(tft if B<ak<h 


This is part of the theorem in my ‘Note on non-homogeneous 
quadratic forms’.t The result as expressed there involves the lower 
bound of numbers -@ such that (10) holds, but it is plain from the 
proof that (10) holds with @ itself. 


t Quart. J. of Math. (Oxford), 19 (1948), 54-8. 
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Lemma 2. Let} 











] 1 
1 = a,+——_ ——.... 
g+ toy a+ ? 
: : (12) 
h—1 =——-- ished 
@_4+ G.+ 
where all the a’s are positive integers. For any integer n, let 
Ee eee 
Ania Onset 
1 1 oa 
8, = a 


n Se ae eeee 
@,.4+ Aye 


Then one value f, of f(x,y) = (x—gy)(x+hy), corresponding to 
co-prime integral values of x and y, is given numerically by 

} 

ae i+4, (14) 

| Sn | 

This is a classical result in Gauss’s theory of the reduction of 

indefinite binary quadratic forms. For a proof, we may refer to 
Dickson’s Introduction to the Theory of Numbers §§ 64, 98. 


3. Proof of the theorem. 

By (6) we can expand g+1 and h—1 in the form (12), with ay = 2. 

Also, since 9 :; 
5 and a 3 
we must have a, = a, = 2 and a_, = a_, = 2. If a, = 2 for all n 
(positive and negative), then g = h = v2, and f is fy. In this case, 
as we saw in § 1, the more precise result (5) holds, unless x», yp are 
of the special form (4). 

We may therefore suppose that a, + 2 for some n. Let n be the 
least positive integer for which a, ,, + 2; if there is no such positive 
integer, the proof is essentially the same on considering the corre- 
sponding negative integer. We then have 


> 


Col 
~1/ Go 


n>2, G4, 42, @ =2 for —2ar<n. (15) 


By (13), the continued fraction for S, begins with at least four 


+ Either of these continued fractions may or may not terminate. In the 
case of a terminating continued fraction, we adopt the convention (as we may) 
that the last partial quotient shall be 1. 
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partial quotients, all 2, and that for S,,, begins with at least 
five partial quotients, all 2. It follows that 


1.e. 
Similarly 8 < Shir < %- 
1 1 


I write 0 = a, 4, +—_ 
‘ n+1 ane 
Onset Onigt 


and have Fi, = 6, J, = 25. (18) 


We consider various cases, according to the value of 6. Note that 
either @ >3 0rl<6< 2. 


Case 1. Suppose that 6 > 3-356. By (16) and (18), 
12 ] 17 
2+ — S. 24-——_ + — < 2-7127. 
39 <5 tS 2+3356+ a = 
We take f* in Lemma | to be | f,|, given by (14), and have 
of = te(F,+8,)?, 
whence } < a? < }. Hence, by Lemma 1, we can satisfy (10) with 


|, 
M = a2f* = (Fy +8,) 


vd vd 
~ T6 (27127) < 5308" 
Case 2. Suppose that 3 < @ < 3-356. By (16) and (18), 
3+ < Fras tSpir < 3:356+%, 
whence 3°41 < F,,,+8,4, < 3-771. 
Taking f* to be |f,,,,|, we have 
4 (3-41)? < a2 < 4(3-771)2, 
so that $< a? < }. 
By the third and fourth clauses in Lemma 1, we get 
vd 


AM an bf? on eg 
? 2(FriatS),+1) 


washed Bf? on oe : -EEH 
AM = (a t)f , 76 Fess Sus) (Fiza tSn+1) 





128 QUADRATIC FORMS NEAR TO 2?—2y? 


according as +;(F,,,,+S,4,)? is less than or greater than 3. In the 


former case we obtain 
/ 
_ 
Mer: 
2(3-41) 
and in the latter case, since the expression in brackets increases with 
Fi tS, we obtain 
/ 
NE am 
3°77 
S 76 
Case 3. Suppose that 1-2 < @ < y (16) « (18), 
91 41 ¢ 
12-99 < F, “aT 


whence 1-61 < F,,,+8,, 


Taking f* to be | f,,,|, we have 
#y(1-61)? < 
, Lemma 1 gives 
vd 
~ 4(F, at Sn ‘n-44) 4(1-61)" 
4, Lemma ! gives 
M = f* = = (Fria tn) 


ae 14143) < 4 
6-6 


Case 4. Suppose that 1 < @ < 1-2. By (16) and (18), 


‘ 
» +S - 3 ee 
“+75 —5+55 =5 < n™ +7 


whence 3°247 < F.+S8S, < 3-4147. 
Taking f* = | f,,|, we have $ < a* < #, and so, by Lemma 1, 
vd vd 
2(F,+S,) ~ 6-49" 
In every case, apart from the exception specified at the beginning, 
we have shown that (10) is satisfied with 


Vd 
MAM << —— 
5-898’ 


M =}f* = 


and so the proof of the theorem is complete. 
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